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SYNOPSIS 

PliiCOUPLEP MULI'IVAHIATIi: TIME SERIES MORELS FOR 
MULTI SITE ST1U.UMFLOV/S 
A Thesis Submitted 

In Partial Fulfilment of the Roquiremonts 
I'or the Do of 
DOCTOR OF 'PlilLOSO Pill 
by 

to tho 

Department of Civil Engineering, 

Indian Institute of Technology, Kannur, India 


The design and operation of a vnator re source a system 
should bo based on a do a r und c? r s t, an d in cj o If t h ( j p i' o g e s s e s 
affecting them which in turn depend on ]xroper analysis and 
interpret-at ion of data, Paa’ticularly important are tlie 
otroamflow data which exhibit complex si.ochastic character- 
istics. They includes 1. a nonstationary behaviour in 
terms of tho seasonal variation of tho mean, standard 
deviation, serial dependence, etc,; 2* trends due to 
natural or human influences; 5. periodic or cyclic compo- 
nents; 4. autoregression or persistonco within the time 
s o r i os; 5 . mo ving a ve rage or ex t or n al e o r r e 1 a t i o n b o t w e o n 
random components at the site ; and 6. (^niu'elatiou in space 
or among time series, Grene:i:ally facto;r'J 1 to 5 are studied 
in tho univariate modelling of a time sorloa, say, the 
streamflow at a site and factor 6 is studied in tho multi- 
variate modelling of several time serios. Knowing tho 
dopondonco or indepondenco of stroamflows at difforont sites, 
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it is possible to design and operate multircsorvoir systems 
by taking advantage of the interdependorico , 

Mathematical modelling of stroamflows at a site is 
itself fairly complicated. Modelling of multisitc stream- 
flows is further complicated by the fact that the multi- 
variate streamflow variability may bo attributed to i. the 
within-the -station variations in terms of factorvS 1 to 5^ 
and ii. the among- the -stations variations in terms of the 
0 r OS s o o r r o 1 a t i ons b o two o n the s t r o amf !1. o w .s a t d i f f e r o n t sites. 
Univariate models in hydrology deal with the former and 
multivariate models generally consider both simultaiioously . 

It scorns possible to decouple the variability due to those 
two cl os BOS so that tho within-the -sta,ti on variability can 
bo first estimated by univariate model,ling and used for 
identifying tho caiserially indoix'ndent random coinponents at 
each site 5 and then these random components can. bo related 
through a multivariate model to account for the inter-station 
corrolabions and the among the stations variability. iiuch 
an approach s coins to bo advantageous because of the following: 

1. The exi)Grtiso developed in tho post in univarirnte time 
series modelling can bo boneficially used. 

2. As tho ■ .serially independent and normally distributed 
variates are used in tho multivariate modol, a better por- 
formancG of the modol can bo expected. 
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5. Sinco the paramo teie aix) estimated in stages , the 
numb or of p ar am e t e is os t ini a t o d in each stage is c oni p a rab 1 c 
to that of univariate and multivariate model rcspcctivoly 
and the problem of simultaneously estimating all the para- 
meters is avoided, 

4. Where data lengths vary from sto-tion to station, 
all the available data can be used in the estimation of 
paramdtors and decoupling may eliminate' errors and bias 
that may bo present in the simultaneouf:) estimation of 
parameters , 

5. Decoupling of spatlaJ. and tempo .ml variations has 
boon suggested recently by Yovjovich, I'lcjia and Iturbo, 
and Yevjcvich and Xarplus. Rather than using simple spec- 
tral and correlation procedures to reprosont spatial 
variability, the use of multivariato models is suggested 
in this study. This is similar to the appro ouches of frost 
and Clarke, and Yovjevich, 

6. Docouplod multivariate models rioorn to bo very power'- 
ful tools in roprosonting comi)lox multiple interrelated 
time series. 

Without loss of generality, the study was restricted 
to throo sites in a rivor basin in North India for which data 
were available and to a 25 year concurrent record of ot roam- 
flow, Data sorios considered in this study Include i) the 
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tendaily historical data series,- ii) tho monthly data series 
calculatod from tho tendaily scries as the average of tho 
t h re c t e nd a il y valu cs in c ach mo ri t h j an d i i i ) tho annu eO. 
series, being tho average of the monthly values in each 
year. Annual data were normally distributed? but monthly 
and tendaily series were normalised by logarithmic trans-" 
formation. Tho data series were standai'discd to a moan aero, 
unit variance series by moans of tho sample moans and 
standard deviations for each of the pe.r'iods. Using ARIMA 
models and a nonlinear least squares togroasion approach, 
uni vrx ri at o mod al s w o r c f i 1 1 o d to tho s b an da r d is o d n o rm al i se d 
time Bories at.oach site. Tho results indicatod autoregres- 
sive models of orders zero, one and three rospoctlvcly for 
the annual, monthly and tondai3.y series , Tho ro si duals wore 
calculated and tested for serial indopendenco . Time domain 
and frequency domain analyses indicated serial in do pen donee 
when tho full series was considered ? but, for monthly and 
tendaily series tho go ire la t io n c o of f 1 c ion ts b o t wo o n the 
rosidunls of adjacent periods wore found to bo significant 
in several cases. This indicated a seasonal variation of 
porsiatonco and so nonstationary autorogrosslvo models of 
tho same order as before but with son.sonafLy varying para- 
meters were fitted to tho monthly and tendaily sorie,s . The 
residuals from tho nonstationary models wore found to be 
serially unoorrela/bed so as on ally and as a whole. Tho uni vtir into 
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residuals were also generally normally distributed. 

The second stage consists in fitting a multivariate 
time series model to the univariate residuals to take into 
account the aiKiong- the -series variation. Using currently 
available procedures (Matalas, Young and Pisano), a stationary 
first order autoregressive multivariate model was fitted to 
the univariate residual series. The multivariate residuals 
were evaluated end tested for independence serially and 
with one another. When considered as a v/holo» the series 
were found to be .mdependent serially and of one anotheri 
but mont]:ily and tendaily series showed significant cross- 
correlation on a seasonal basis in severrlL periods. This 
suggested a nonstat ioiiciry multivariate uiodol vfith seasonally 
varying parameters. Por the tendaily series, for four sea- 
sons it was not possible to fit nonstatiouary fii^t order 
autoregressive multivariate mode3.s using the procedui'ea 
adoptedifor stationary modelling. This is because of incon- 
Siatencios in the sample coarrelation matrices and the problem 
was similar to caBos mentioned by Matalas and Wallis. Y#erooth 
order models were fitted to the tendaily serioa in such cases, 
bevolopraont of procedures for fitting first order models and 
for estimation of parameters in such cases is beyond the 
scope of this study. Monthly data did not exhibit any such 
problems. The multivariato residuals from the nonstationary 
model were estimated and when tostod statistically, they were 
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found to be independent serially and of cne another, sea- 
sonally and as a whole. 

It is possible to re couple the multivariate and 
univariate models and also use inverse transformations to 
denormalise and destandardise them in the proper order so 
that the complex relationships between the actual flows 
can be derived in terms of the parameters estimated earlier. 
These relationships can be used to predict missinfT; data if 
any 5 forecast future values of the streamflowa at any site? 
£md generate multi site sti’ej^mf low data for simulation of 
complex water resources systems. 

One step ahead forecasting and predictions were 
considered in this study using the multivariate model deve- 
loped earlier. It was seen that generally multivariate 
estimation and forecasting leads to a hotter definition of 
the expected value and confidence levels of the vai'iable 
concerned than in the case of univariate models. 

On tho basis of this study, the following conclusions 
can be mades 

i) Decoupling facilitates the use of aimplor componont 

models in the modelling of complex stochastic processes, and 
leads to a bettei- understanding of the temporal and spatial 
dependence between streamflowa at different aites. 
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ii) For the standardised annual series, a zoroeth order 
univariate fciutoregrossivo model and a first ord or multivariate 
autoro^^resaive model are indicated. For the noimalisod, 
standardised monthly series, a nonstationary first order 
autorogrcssivo univariate and multiveiriate model are indi-* 
Gated, Tendaily data after normalisation and standardisation, 
can be represented generally by a third order autoregressive 
non stationary univariate modol and a first order autoregressive 
no na tabionary mul/ti variate model. For some seasons, the sample 
correlation coefficients for the none tat ionary modol were 
inconsistent with the assumption of a multivariate first 
order autoregressive model. In such cases, a zerooth order 
multivariate model was fitted to the tendaily series. 

iii) Multivariate models can bo used fox" the estimation of 
missing values, and for one step-ahead forecasting. They 
generally lead to o. better definition of the conditional 
expectation and a smaller standard deviation of the forecast 
or estimate. 

iv) Data can bo gonoratod on the basis of the docouplod 
multivariate modol and use In simulation EOialysis of multi- 
reservoir systems. Decoupling is a very powerful tool in 
the mathomatioal modelling of interrelated tine sories and 
tho procedure developed herein can be used for modelling 
other multivariate processes. 


1. INTRO me *rioM 


1.1 General 


The design and operation of a water reeiouroes system 
should be based on a clear understanding of the processes 
affecting them which in turn depends on a proper anaJ-ysis 
Slid interpret.'i.tion of data. This prol-ylom is becoming 
increasingly iiii|)ortant owing to tho necessity of opl;im/illy 
utilising the liiuibed available resources of the earth 
particularly because populotion and per cax>ita demand are 
inoi’easing exponentially. Historically a number of proce- 
dures have heeu developed and applied hydrologists and 
watoi' resources engineers to quantify and evaluate the 
availability and variability of water resoiirces in time and 
space. These Include (i) mass curve and related analyses 
of historical data (ii) empirical corrolation coefficients 
( i li ) d 0 1 0 rm i ui s t ic s iinulat i on mod els an d ( i v ) s t ochas t ic 
models. They will now be dealt with biaofly. 

1,1,1 Maas curve and related analyooi-i of historical data 

The traditional method of estimating tho required 
storage capacity of reservoirs makes use of tho historical 
data of streamflowo. Using the mass diagram of inflows and 
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deraai'id‘3, rainiiiiuin storage required is ostimated under the 
conditjon that no vrater shortage occurs for the historical 
sequence of inflows. A modification of tliis method is the 
s e quen t p e ale a Igo r i t h m ( F i e r i ng: , 1967). This i s a. c omp\i t e r- 
orieaited numei'ioal procedure which malces the estimates of 
storage less ambiguous than those ob'iuvuied by the graphical 
procedure , But both proc eduro'-' g^eno^rally make use of only 
the historical I'ecord of s t re nni flows and it is unlikely that 
the same se queue o will bo repeated during the life of the 
structure . 

1 • 1 . 2 rUmp i r ic al c o rr e lat i on r e 1 a t i o - j b h J. p s 

These relationships are used to estimate runoff when 
histoilcal data of runoff are inadequnto. They make use of 
the correlation of runoff with indepondont variables like 
hy drometeorologic al and catchment characteristics, A set 
of three or more variable charts that relate the dependent 
variable i n 'b e rmn of t ho ind e p on d on t va r i ab 1 o a 1 s p rn pare cl « 

F or ex am pi e ^ 1 i ns 1 ey * s coaxial c hai' 'b ( Xiin b ley o t al , , 19 5 B ) 
relates runoff to precipitation, ant oco dent moioturo and 
week 0 .f.‘ t'he y o a v • S uc h g I’Up h leal p .to o o d ur e s a:r e u s o f u 1 
when independent variables are nonl in early correlated among 
themselves and the nature of relationship is not known. 
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1- 1 . 3 33etermiuistic slmulat ion mod elo 

Precipitation forms l;he input to the hydrologic basin 
and, runoff, both surface and subsurface, forms the output. 
Runoff is obtained by the modificntiou of precipitation by 
e vap o r at i on , inf i It rat i on eind su r f ac e and s ub sur f ac e s t o rages . 
An estimate of runoff can bo made on the basis of quantitative 
assessment of these component processes* Fron\ knovjn concurrent 
data of preGi])it/.Atioii and other iiydromcteorologlcal data as 
well as runofJ' da.tn ajid on the basis of an assuTtied mathemuticnl 
model for the has 1 n , l;h e pa r ame t e r s o f th e mode 1 are e s t i ma ted . 
iCn owing the parameters and for any given or assumed precipita-- 
tiO]i the component processes includiri,*; runoff are estimated 
through simulation. The simulation models in use Include 
(i) JJtanford Uatorshed iiioclel (ii) Tin; Utah V/atershed Simula- 
tion model ( i i i ) The Agr io ul tn r al Re;:: o : ' r oh So rvi c e model 
(iv) The Columbia River I'iasin model. Details of these models 
are available in x’eferencos (Nordcnsoii, 1969 5 Clarice, 1973). 

1.1*4 Ct ochas t ic mod ol a 

Stochastic xjroceas models s Stochastic process modola 
in hydrology Include simple bivariate models (Sarialiraed ot al., 
1968), models of finite duration processes (Hamasoshany 1971) ^ 
queuing models (Dangbein, 1956), transition probability models 
(Moran, 1959) p random walk models (dox et al., 1965) and 
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linear stochastic system models (Rao et al. ^ 1971). They 
use mathematical theory to study the processes and to make 
inferences concerning the characterifi hie b of the processes, 
Because only siiux:)ler mathematical models can be analytically 
solved, they are useful only for simple idealised represen- 
tations, Real hydrologic systems are nonlinear and interact- 
ing. Rurthermorc hydrologic processes o,re highly seasonal 
and they also exhibit serial- and c j'o:"scorrolati on that may 
va ry s e as onal ly . Ho jic o analy t i c s t o c ]'j n a t ic mod e Is 1 i s i: e d 
above are of Uni i ted use in studying the stochastic charac- 
t o r i s t; i c s o f s t I'c amf lo w , 

Time s 0 ,] ‘i c s m od els s A s e quo nc o o f valu e s o c c ur ling 
or observed along time is called a time series.. Unlike fre- 
quency analysis where the serial dependence between events 
is ignored, tJoic series antilysis tat;or.' the sequence of 
occurrence of events into account. Time scries may in gene- 
ral have t ho f o 11 o^^i ng c ompo non t a 2 ( i ) Trend s T hi s i.nd ic at es 
the long t e rm d(j t o rni i n i s Li c chan g e ;i. n ib o t i mo a o ri o a an d 
may be duo to natural or human agonciers, (ii) cyclicity s Thir? 
refers to the periodic component that repeats itself at 
definite intervoAs , Annual cycle is nxi example of cycles 
that are Important in stroamflow series; (lii) autoregression 
(AR.)s This refers to the linkage oxlsting bestweon the value 
at a given time with earlier valuGvS of the same series, 



(iv) moving average (MA); This is the external correlation 
betv/een ihe flow and the present and parit random components 
at the site, and (v) the pure random component. Cenorally 
these five factors are considered in the univariate modelling 
of a time series, as, for instance, the time series of stream- 
flows at a site. There is, in additioii, the external persis- 
tence which refers to the linkage between the variable at a 
given time and concurrent as well as earlier values of other 
time s c r i o s « T I'j c s o are c o n s i d o .r o d j.i i i'hc mu 1 1 i vnr i ate 
modelling of several time series . .Knowing the dependence or 
inde].ien donee of stream flows in a multi reservoir systoui, it 
is possible to do* sign and operate the system better. 

l.£^ Statement of the Problem 

Multivariate modelling involves tho representation 
in mathematic-' si of tho variabilities of streamflow at 

each site and among the sites. This is however complicated 
by tho fQ.ct that tho variability of multisito atrcamflows 
may be attributed to (i) tho withln~tho -station variatio.n, 
say, in terms of the MA and tho serially Independent 
ran d om components at e ach site, ( i i ) t h o am o ng- 1 he -s t a t i on 
variation in terms of tho crosscorrelati on between the 
streamflows at difforont sites, and (iii) the Internally 
and externcilly independent pure random component at e£ich 
site. In several earlier studies (Board, 1965 j Matalas, 1967 
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young et al, , 1968 j McMahon et al, , 1972 5 Tentland et al.,1971 
Matalas et al. , 1971 ) they have been considered together. 

In this study it is proposed t:o develop a multivariate inodoj. 
that decouple s the ab o ve va ri a tl ons * 

The problem mriy be defined as follows; 

i) to represent the multisite Btre'-i.iflows by decoupled 
multivariate time series models in v/iiich initially the 
within-the- station component is represented by appropriate 
univariate time series models 5 and subsequently the residual 
independent random conpononts at oacli site that constitute 
* prewhitoned • time series are re3.atof} through a general 
linear multivariate model in terms of the internally and 
e jct e m ad. 1 y in cl e pe n d ent pure ran d om c 0 m po n en t s 5 

li) to test the validity of the model using avEiilable histo- 
rical data, and 

iii) to use the results fo.r one -step-ahead forecasting. 

1.5 Objectives of the Otudy 

The objcctivoa of the study arc as follows; 

i) to fit suitable univariate time soilos models to the 
stretnjiiflow at each site and hence to estimate at each site 
the serially independent random components 5 

ii) to fit a suitable multivariate model to the serially 
independent x’eslduals at each site and hence to ovaluato 
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the serially and mutually independent random componento} 

iii) to -validate the fitted decoupled multivariate model; 

iv) to identify the complex stochastic relationships 
between the multisite streamflows by inverting the relation-- 
ships, and 

v) to use the decoupled model for one~step~alaead forecast- 
ing. 

1,4 {Significance} of the 3tudy 

The proposed approach tor the analyais and stochastic 
njodelling of stroamflows at several s:ites seems to be advanta- 
geous because of the follov/ing reasons, 

i) The expertise developed in the past in univariate time 
series modelling can be used beneficially. 

ii) f^ince serially independent random normal variates are 
used in the multi-variate model, a better performancie of the 
model can be expected than with the serially correlated data. 

iii) u inc 0 t h o paramo t e rs ar c e s t im a-b c d in stage s , th e numb e r 
of parfimoters in the two stages arc comparable respectively 
to that of univariato and multivariate models and the problem 
Q-f simulteuieoualy eetimat-ing all parameters is avoided, 

iv) Where data lengths vary from station to station, all tho 
available data can bo used in the estima-bion of the parameters 
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and decoupliiiij^’ eliminates errors and bias that may be 
pree^ent in the simultaneous estimation of parameters, 

v) Decoupling oi spatir.l and temx')oral variations has been 
suggested by Mejia and Iturbe (1974 ) and Yevjevich and 
Karplus ( 1972 ), Rather than using distanc e^-correlati on 
procedures y the use of a multivariate model is suggested 
to represent spatial variations , The concept is similar 
to that of Frost and Clarke (1973) and Yevjevicli (1975). 

Vi) The procedure enables the most appropriate form and 
order o :[' the uni va ri a 1 0 model to be c 1 1 0 se n i n d e pen d e nt ly 
for each site. The form and order of the scheme for repre- 
senting oxtomal variability arc independent of the scheme 
for rG])rescntin/; Internal variability. This lends flexibi- 
lity to the whole procedure enabling ext'raction of maximum 
information from internal and external correlation compo- 
nents. The decoupled multivariate model is hence a very 
powerful tool in representing complex multiple interrelated 
time series - 

1.5 Ccope of the Study 

The scope of the present study is limited to the 
followings 

i) Data of three tributaries to a major river in Worth 
India are taken up for analysis . 
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ii) The analysis is restricted to short lag models. 

iii ) The available date consist of (a) three values for 
each month y the first being the average of the flows of 
the first ten days of the months the second, that of the 
next ten days and the third, that of the remaining days of 
the month. These data are referred to in this study as 
tendaily flov;s j (b) flows for the corresponding months 
ai’e estimated by averaging the 5 ton daily fXov^s for the 
month and (c) annua ,1 flows are cumuli. tod by averaging the 
12 monthly flows of the year. 

iv) A first order Ah multivariate model is used in this 
study to represent the relation between the univariate 
rCvSiduals . 

v) hstiination of the parameters of the univariate and 
multivariate models is done according to some generally 
accepted procedures 5 and 

Vi) One- step -ahead forecasting' is coiisldered for the proposed 
model. Multistop ahead f orocavSting which is an extension 
of the above is not considered . Fui'thormoro , forecasting 
and ostimatioj'i are tested only for a few time intervals. 
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1.6 Organization of the Report 

i'he atudy is reported in t.id following sequences 

i) Univariate and multivariate models for streamflow sequen- 
ces are briefly reviewed- Ihe proposed decoupler] model for 
multisite stremnflows is presented and compared with availa>- 
ble multivariate models (Ohsipter 2). 

ii ) Details of the preliminary statistical analyses of data 
are reported in the following sequence: nnalysis of histo- 
rical data with respect to its distribuhion and parameters j 
normalisation of the historical data using appropriate trans- 
formations and estimation of the paroineters; and standardisa-^ 
tion of the normalised data to hove zero mean and unit 
variance (Chapter 3 ) . 

iii) fitting of the univariate model to the normalivsed, 
standardised data is reported in terms of a general linear 
stationary stochastic jnodel^ viz. , the autoregressive- 
integrated-moving average (Al^MA) model. Uteps include 
identification of a suitable model 5 estimation of the para- 
meters of the identified model? and diagnostic checking of 
the residuals for independence and nonnallty . The anaJ^y- 

sis is extended to nonstationary univariate models (Chapter 4 ). 

iv) A multivariate model is fitted to the serially uncorre- 
lated resldualo of the univariate model, and the parameters 
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are estimated, The residuals are tested for noimality 
and for internal and external independence. A nonstationary 
model is found to be necessary for monthly and tend ally 
data (Chapter 5). 

v) The components are recoupled to yield the multivariate 
relationships. They are used for mult-i variat e one-step-ahead 
forecasting (Chapter 6). 

Vi) A brief siunmary of the study is givcn^ conclusions 
drawn on the basis of the results of the study are presen- 
ted and some suggestions are made for future investigations 
(Chapter 7). 
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TliViB SIdRIES models IN IJYRROLOaY 


2.1 Introduction 

Gonnider a stochastic process, say Q(t), the streamflow 
at time t at a particular site, the sequ-ence of values of the 
variable at tii)ies t^, t^? ... tj^, via.; Q(tj,), Qit^) , ...Q(%) 
represents a discrete time series refeired to herein as the 
time series. The time series is a sequence of random vari- 
ables wi t h £.m ot s s 0 c i a t e d p ro b b 1 1 i'by d b t I’i b u t io n I'u n c t io n f o r 
each value of t. Since a sequential relationship may be pre^- 
sent b et ¥ e e n the val ue s of the var iab 1 e a t di f f e rent t i m es 
it may be necessary to define the joint probability clistrlbu- 
t i on fu nc t io n B as s oc i a. te d w i th an y a rb i t r /.iry set of t imo s , 
say, t^, ± 2 ^ ...t^^. This concept provides a probabilistic 

description of the process. Time aeries models are used to 
define such relationships and time series analysis is used 
to model the procosvS on the basis of a hiutnrical record 
which ivS only one of the infinite possible realisation b of 
the under lyirq^ stochastic process. 

A time series is said to be strictly statlona.ry if the 
joint probability density function of the fainllios of random 
variables [Q(t^), ...Q(tjj)J and [Q(t 3 _^.j_), 

. . . Q( . ) J is the same for all i for any sat of t-| j t,, 



i,e,y atationarity is aciiieved if the distribution is inva- 
riant with respect to translatio]i in time. This implies 
stationaiity of niomenis of all orders of tlie tniiio series, 
fiist order stationarity implies that the meai'i Is a. constant 
and second order stationarity implies that the autocovariance 
at lag iy viz.y E[Q(tj) where h stands for the 

expectation, is a function of i only. In streamflow analy- 
sis , it is customary to assume weak statlonari by , i.e., 
s t a t i on a ri t y w i th r e s]p ec t to f i rs t aa id rj o c on d m omo n ts . 

I n gene ral , tli c f i is t and s >- ;co nd m omc nt s of flow, v iz . 
mean and vaiiance vary fraii season to season within an annual 
cycle. 111 addition, certain influences of man like the 
c 0 n s t ru c t i on of dam may int ro d uc e n on s b a 'b io n m ri ty in a f lo 
s e ri es . In o r d gi' to si inpl if y th e an a ly si s , 1 1- i s o f t en the 
practice to remove tho nonstation arity from the time scries 
whenever possible and to analyse the stationary residuals 
(Roosner and Yevjevich , 1966f Young and fisano, 196a). for 
example, the nonstationarity may bo due to tho prosonco of 
long term trends and .periodic cycles In the data in which 
case they can be estimated by standard procedures (3 gc. 3,3) 
and eliminated. Tho residuals then cons’tiLtutc a stationaiy 


time series . 
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2,2 Univariate Models 

Mathematical models used to represent a sin^e time 
series are referred to as univariate models. They may be 
broadly classified into short lag and Ion,-:, lag models and 
they are briefly described below, 

2.2.1 bhort lag 

A x5-th order auto regressive (AR) model may be represented 
as 


Q(i) 


P 

T.. 0. Q(t-i) h s(t) 

i-1 


( 2 . 1 ) 


where is the p«th order autoregressive coefficient and 
e(t) is the sei'ially unc orrelate d comxionent at time t. 

A q-th order moving average (I'lA) model may be repre- 
sented ai , 


q 

Q(t) = e(t) +29^ e(t-q) (2.2) 

1=1 '1 

where 0 is the q-th order moving avGrag;c coeffi oient opG.rat- 
4. 

ing on e(t“q)^ the serially unc or rela’be d component at time 
(t-q) . 


The aut or egre s si ve -moving average (AI^JVIA) model consti- 
tutes a genea^l class of linear stationary stochastic models 
and includes the AR and MA models as special cases. A (p, q) 
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order AJRMA model is Al^ to order p and I'lA to order q and is 
described by the equation, 

p q 

Q(t) = i: Q(t-i) + e(t) + E 0. c(t-j) (2.3) 

i=l ^ j=l ^ 

J^lq. (2,3) indicates that the effects of Q( • ) md e(.) persist 
for p and q time periods respectively. Grenerally p and q 
are equali to or less than 3* Models vrhich have such a short 
lag are referred to as short lag models. AR, MA and AEMA 
models have been used extensively in hydrology- For example, 
firs t ord er AR m od at s have bee n f 1 tte d to annual st re am fl ow 
data (Yevd jevic )a« 1964) and to montlxl-y stie^^nflow data 
(Roesner and Tevjevich, 1966). A second order AH model has 
been fitted to noruiaHsed daily s tream;rLo\f data (Quimpo, 1967). 
ARMA models of order (Ijl) and (2,0) ivorc IMtbed by Oarlson 
et al . 5 (1970) to annu£\l stresmflow data. 

The above models are stationary with respect to mean 
and standard deviation and they also assume covariance sta- 
tionarity. These assumptions are relaxed Lmd a seasonal 
variation of the parameters is assumed in a class of univa- 
riate models. They may be referred to as seasonally nonata- 
t i onary un iv ari a te ra o del s . The Thomas -Fi o ring m o de 1 ( 1‘ faas a 
et al. , 1962) was suggested originally for monthly stream- 
flows and is given by, 

= + ■bhQ(t)-Qh + o(-t+l)sqi-(rh^]^ (2.4) 


Q(-t+l) 
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v/here t varies sequentially fraii, say, 1 to N, the total number 

of time periods and j varies cyclically from 1 to 12; (} . , s 

J 0 

r.- and b. respec tivt^ ly denote the mean flow in the j-th month, 

u u 

standard deviation of flow in the j-th month, correlation 
coefficient between tlie flows in the j^th and (j-l)-th months, 
and the re gre s s i on. co ef fie lent b e e e ii t lies e two f 1 o vj s , e ( t+1 ) 
is the serially uncorrelated nomal.ly distributed residual at 
time (tml) with zero .mean and unit standard deviation. As the 
par am etc IS , s^, and vary with the season j , this is 
3 first order rionst ationa ry A.H model. 

A more goneral class of models is ono which is nonsta- 
tion a ly and whose stationority can be aoiiioved by repeated 
differencing of the original series. llioso arc referred to as 
au t o r 0 gr 0 s s i VO - i n to gr at e d ~m o vii:L g a vo rag o ( Ah IM A ) models o f 
order (p, d, q) vrherc d indicf.itcs the order of differencing 
re qu ire d to lo n do r t .h o or ig in al series s ta t io n a ry . AR.TM A 
models have been used to dose rib c the monthly strotimflow scries 
by Me Kc rch ar an d Do 1 1 our (1972). 

2.2.2 Long l-iig models 

Consider the time series (Q(t), t = l,2,...h). Lot the 
partial sum of flows bo, given by 


c(t) = 


t 

2 Q(l) 

1=1 


(2.5) 
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The rarig 0 of cumulative flous ll is defijied as 

R = Max [0(t) - S C(fr)J - fiin [C(-t) - wC(N)j 

“ _ 1 -■■■I- ...I'l 

( 2 . 6 ) 

Range is a function of Q(t) snd. so is itself a stochastic 
variable. It is one of the in(5icators of the storage require-^ 
menta of reservoiis (Riering^ 1967). It can be analytically 
shown ( livelier j, 1951) 'bhat for pure rfindori and AR series, the 
N-year range R(R) is 2^ro porti onal to s * wh ere s is the 
standard deviation of the vo.riablo. liirst (1951) empirically 
ana]ysed natural data and demonstrated that 

li(lJ) = 0.61 (2.7) 

vrhere the Hurst coe.i.'f ic lent U was found to have an average 
value of 0,7^- ■ in the case of streamflow da'ta. The lov7 lag 
AR models do not preserve the Hurst coefficient. Use of low 
lag models then results in an unde r-repro sen ta t ion of extreme 
events (Mandelbrot, 1968). 

Mandelbrot and Wallis (1969) proposed a model for 
na t ur al pr o ce s s o s th a. t c xh ib i t a Hu rs t c o c;f; f i c i o n i: no t e qu aL 
to 0.5. Re a i gna t e d as P rac t io na 1 Craus s i an .Ho i so ( R(tR ) mo d o 1 , 
this is designed to preserve the Hurst coefficient. In phy- 
sical terms, PON has an auto corr elatio.n structure represent- 
ing the oxis'lBnco of a veiy long momor;/ where the distant 
past exerts a small yot vSignif icant influence on the present. 
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Because of the large number of teinis and the long lag const-’ 
dered, they are referred to as long lag models. Tho correla- 
tion between success ive events for these models may be small 
and large lag correlations are even smaller and yet their 
cumulative effect is not negligible, ilie fast FGN model 
(Hand elbrot , 1971) is a simplification of the general model. 

It consists of a low frequency component which is a weighted 
sum of a number of Markov processes and a high frequency 
component which, is made up of one or more Markov processes. 

The b r oken 1 in e m o de 1 ( Garc i a e t al . ^ 19 12) also p r es c rvo s 
the Hutst coefficient and it has been slaom to bo an approxi- 
mation of model , 

The ap plies, tion of fGil models to hydrologic data involves 
the estimation £ind uso o .f the Hurst coefficient. On the basis 
of extensive computer cxpoiiments. Wall is and Mat alas (1970) 
found that its standard error is quite lG\rge and hence '’Ofor 
the size of data records normally encountered in streamflow 
anaiy s e s ^ i ts e s tima to is lx Icc ly to bo qui to u nre 1 iab 1 e, Th e 
presence of such a long term direct and stationaiy dopendenco 
is rather dilTicult to justify. I'urthonnoro , AHMA models have 
been shown to offer a systematic procedure for the represen- 
tation of complex linear stochastic hydrologic processes and 
AHir^IA models can repmsent ever nonlinear effects. Hence thj-S 
study is lestrlcted to short lag models onJ.y, 
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2,3 Multivariate Time Series Models 

The sequence of values of one variahl.e constitutes a 
univariate time series. Sequences of values of a number of 
variables constitute a multivariate time series. The modell- 
ing and analysis of multivariate time series should take into 
c onsi deration not only the serial dependence within each of 
the time series but also the mutual or crosscox^r elation among 
th e t irn e series. Mxi 1 1 1 va r i a te models in h y d ro 1 o gy gen ora, 11 y 
belong to one of tho followmg categories: 

2,3.1 Short lag models 

In order to xx^duco the d imens ionali/by of the problem on 
the basis of dependence among the variables s, Fiering (1964 ) 
proposed a principal component model for multiple time series. 
It was shown by Mat alas and acknowledged by Fiering (1967) 
that the abo-^ approach does not presei-vo the correlation 
structure. Torraniri (1972) suggested canonical correlation 
for app lie a.'l: ion in mi il t iv n.r i ate b y dr o '1. o gic mod cl 1 in g; . al aa 

(1967) pvopoani .’t multlvarh-it-. . f'\ j:st or'dux^ Af?. I'lodol given by, 

{Q(tf,l)} [A] + [I'l (^.G) 

whei'u if3 a vectoi’ oj’ jx vuriablv..-!.j , say, stre amflovra 

at K al at time tf is a vector of K aorirlly and 

mutually indepaxdent variables at time t oach with zero mean 
and unit variance, and [AJ and [B] arc coefficient matrices. 
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The components of '^Q(t)j may also represent different 
processes at different sites at time t, ^oung and Pisano 
(1968) combined the univa-riate standardis ation procedure 
with the Matalefi model and proposed an algorithiTi for unique 
estimation of [B] assuming it to have a lower triangular 
form. A model that lends itself to disag-gregalion v/hile at the 
same tine preserving the stochastic characteristics at the 
different levels of disaggregation was formulated by Valencia 
and dchaake (1972), It consists in fir^t developing a suitable^ 
annual f 1 o w mod ol w hi ch may be s t at io n d ry , The di s ag g r e g a ti on 
model then develops seasonal flows thorofronu The model due 
to Beard (1965) consists of multiple regression equations 
connecting; the f3.ow at each site to the pix.)vious flows at all 
s i te s i nc lu d in p; 1 1 s c If and curr e n t f 1 o v; s at. other sites, A 
set of such multiple regression equations, one for each site, 
constitutes the multivariate model. 

As in the case of short lag univariate models, the 
short lag m ul t i va r 1 t e m o d e3, s ca nn 0 1 a jp are n t ly p re s e r* ve t h e 
Hurs t c o ef f i c 1 0 n t , 

2.3.2 long lag mo del s , 

Long lag models are so designed that they preserve the 
Hurst coefficient at each site. Mat alas and Wallis (1971) 
have extended the univariate PG-N model to two sites. Adopting 
the Type 2 approximation of PG-H, they u.so the data to find 
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the mean, variaice, skew coefficient, first serial correlation 
c 0 ef f ic i ent an d ilurst co ef f i c lent at e ac h s i te . Th en th e 
crosscorrelation model is set up expre soin;- ; the crosscorrela- 
tion between the flows in terms of the above statistics, Ihe 
multivariate broken line model (Mejia et al , , 1974) uses the 
lag zero crosscorrelation matrix betv/een t]ie flows at different 
sites. The lag one crosscorrelation is not preserved. It is 
an approxijnatiou of the .FGN model and details have been deve- 
loped for only a two variato model. -*-'ong lag models are 
reviewed briefly above in order to present the st ate -of -art 
in role vent but different approaches to multivariate model- - 
ling. They are at present (1976) lliiiited te only two vari- 
ables and are beset with paiametor estimation problems. They 
are hence not used in this study which is restricted to the 
c onsid oration of short lag uni- and multivariate models only, 

2.4 Decoupled Multivariate Models 

2.4*1 t^ecd for a decoupled approach 

Let the multivariate vector of stroamflows be given by 

( "fc ) ^ ~ ^^^1 ^ ^ ^ ? ^2 ^ ^ ^ > • • • '‘^IC ^ 
where Q^^(t) is the streamflow at site k at time t; K is the 
total number of sites and T stands for the transpose. To 
represent the relationship among the time sex'ies , a general 
(p, q) order multivariate ARMA model of the follovring foim 
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may be considc C’ed. 

= C -^"1 I * + Ca,„ j ., ( 1 ' 

+ {:':(t+l)} +.,.-i- 1,< J {,3(b-)2-.q)y (2.9) 

where [A^^], i ~ 1,2., ...p ore the AR go olli cient matrices and 
[S-J, i = 1?2, . q are the MA coefficient matrices , all of 

J 

dimension IC x IC, where K is the member of sites considered. 

This equation can be considered as an extension of the univa- 
riatf- . ARMA model to the multivariate doiiialn. VJith adequate 
data, this equation can be solved for tho unknowns. But 
because of the large number of parameters, the results arc 
likely to be data dependent and unstable. 

The complication in multivariate modelling is due to 
the fact that the variability of multi si to streamflows is 
caused both by the internal correlation of flows at each site 
and the external correlation among tho flows at tho various 
sites. It seems possible and desirable to decouple the two 
and deal v/ith them separately, A univariate stochastic 
model ma^ first bo fitted to each of tho flow scries thus 
accounting for ‘the in'bcrnal oorrelation. The resulting 
residuals are referred to as prewhitened series since they 
are serially independent and normally distributed. A multi- 
variate model can be fitted to the univariate residu&il series 
in terms of serially and mutually uncoirelated random compo- 
nents to explain external correlations. 
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Such an approach for the modeD.lirid;; of multivariate 
time series seems to be advsivbage ous because (i) it enables 
the utilisation of techniques of univariate time series 
analysis, which aie quite advanced j (ii) as the parameters 
of the univariate and multivariate models are estimated 
separately , their number is comparable to that of univariate 
and multivariate models respectively and the question 

of simultaneous estimation of all the parameteis is avoided, 
and (iii) it affords flexibility in the choice of appropriate 
but different models for the univariate and multivariate 




time series . Hence the decoupling ax^proach seemvS to hold 
promise of wide application in multivariate modelling of 

'i i-i -w’ o 1 Q .n o ci t't P Q 

2.4*2 Other deco uple d mod els 

Kar e li q b i s ;a'Kl Oho w (1 97 2 ) sugge a to rj a px’o o e d ur e 
I or th o an aly E a s o f ro f5 i d u a 1 h.y d .to 1 ug i o ' s t oo h as b i c 
pr u o e Ej wo vi whi o ii m ay bo c o xi tii (i o r d as a s pe o 1 al o asc? 
of the deof up led modol. 

1 t: V J « V rcii anu x\.cl.l’pxu»3 (Xyi^-/ Ucrivo- JLIJ i/J, U uuuc (.1 UiiC 

decoupling approach by first determining the univariate 
r e 8 i du aL 3 at each site an d conn oc t in g th ces e r e si d ual s by 
distance -correlation equations. Frost arid Glaiico (1973) 
fitted first order All models to the s brcernflow at two sta- 


tions, and a multivariate white noise model, to the residuals. 
Their model is e quivalent to assuming a diagonad. form for 
the [Aj matrix in Bq, (2,3) and estimating the corresponding 
[B] matrix. Recently Yovjevich (1973) has fitted second order 
AR models to the net basin supplies of four of the Great lakes^ 
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and a multivariate white noise model to the univaria'te 
residuals. The decoupled multivariate models developed in the- 
QQ studies are stationaiy models, in which the correlation 
structure is assumed to h e invariant with respect to time. 

2.4.3 Methodology proposed for the decoupled model 

The mathematical modelling of multi site streamflow 
sequences by the proposed decoupled multivariate model 
invo Ives the f o 11 ow in g s t e ps s i ) no im al, i, s at i on of s t re am ~ 
flows at each station by logarithmic , square root or other 
transformations as appr-opriate j ii) statidardisat ion of 
streamflows at each site to achieve stabionarity in mean 
and standard deviation 5 and iii) fitting a univariate model 
at each site to separate the persistence and serially inde- 
pendent components. It is proposed to use only short lag 
models in this stud 3 '" , ISuitable models include those of Box 
and Jenkins (1970), Thomas -Piering (Maass et ai, , 1962) etc. 

Por example, the univariate ARM A model is given by, 

x(t'hl) = 0 ^ x(t) + x(t-l) “I- ...+ 0^ x(t+l-p) 

-h E ( t-f 1 ) 'h 02 t ..d'H 0^ e( t-h 2 — q ) 

( 2 . 10 ) 

where x{t) is the normalised, standardised flow at time t, 

e(t) is the serially independent component at time t and 0^ 

and 0 ■ are respectively the i-th order AR and j-th order MA 
J 
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coefficients. This may be ex'l:ended for a number of sites as, 

{x(l;+l)} = [6f^] {X(t;i} + ... + [0^,] {x(t+l-p)Tf 

+ [e^J .|^E(t+l)]- +...+ [6^J .^e( t+2-q)'^|- (2.11) 

where •rx(t)^ is the vector of normalised standardised 
streamflow series at time t, ■^s(t)^ is the vector of the 
univariate residuals at time t and [ j and [© •] are 
diagonal matrices respectively representing the i-th order 
AR and j~th order MA coefficients of the univa.riate models 
at the several sites under study. In B’qs. (2,10) and (2,11), 
the univariate models are assumed to be stationary. The 
model can also be modified appropriately to account for 
nonstationarity when seasonal variations in the correlation 
structure are talcen into account. After the univariai;e 
models are fitted, the aerially independent random components 
at each site may be tested for their randomness and distri- 
butionj iv) Let '[^(tj^ be the vector* of univariate residuals 
obtained by standardising the '^s(t)^ vector via,, 

■{j3(t)’^ = i(t)/aJ] (2.12) 

Where is the standard dQvd.ationo of univariat^-^ roBldua.lo 

for soaDon j* Lot ^'n(t)^j be a vector of pure random compo- 
nents at time t which are free from serial and c rosscorr elation. 
Then |E(tj}j may be related to (p?l) order multi- 

variate ARMA model. In this study, the simpler Matalas model 
corresponding to p = 1 and q = 1 was found to be satisfactory, 

* 9 


Viz 
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{y(t+l)} = [CJ ■[j'Utjll + [D] {.Kfc+l)] (2.13) 

where [0] and [ I>J are the coefficient ma'tricee , Depending 
on the nature of streamflow data, [Cj Jind [Dj may be constant 
coefficient matrices that do not change with time or season 
and hence are stationary j or because of seasonal variations 
in cross correlation structure, they may vary with season or 
time and hence are nonstationary ; v) Dy x'e versing the steps, 
the coupled multivariate model can be derived. It may he 
shovJii from Eqs. (2,11) and (2,12), that tJie coupled relation- 
ship is o f the form, 

^X(t4-l)^ = +•••+ L^;,] “^^(t+l-p)^ 

[Qi] [c] +...+ 

+ [QqJ [cj [u(t+i-a)'^ 

+ [0iJ [ja] ■f'i(t+i)'^ + ... 

+ [eq] |-J,j (2.14) 

In the right hand side of Dq, (2.14), the first p terms 
represents the AR component with reference to the earlier 
values of the aaiac time series, the next q terms represent 
the c ross correlated moving average terms among the serially 
independent random components and the last q terms represent 
the cross correlation among the pure random canponents, I'br 
the case of stationary univariate first order KR models at 
all the sites, p = 1, q = 1 and - [f ] y so that ]j5q, (2.14) 
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becomes 


^X(t+l)^ = [G] {E(t)} 

+ [D] fn(t+l)]f (2.15) 

iCnowiiig the mean and standard deviation o:C the transformed 
flows and f or the season •|}L(t+l^ may be 

destandardised with diagonal matrix with olcmonts 

w O’ 

{t(i+i)} = {yj+h + [^<il f-(W} 

+ [G] {;.;(!;)) 


+ [s^'*'h [sf-^] [D] {n(t+i)'5 


j+1 


( 2. 16 ) 


Depending upon the normalising transformation used^ the {Y(t?j 
series can be trivis formed back, to the original '^Q(t)^ series. 


The proposed decoupled multivariate model was developed 
independently of Prost and Clarke (197!^) and Yevjevich (1975) 
and differs from the earlier ones in a number of features , 
viz. , 


i) The decoupling of the within~th e-otation correlation 
and amo ng-the -station correlation as well as pro whitening 
of the univariate series are specifically identified,* 

ii) Different univaria-be aid multivariate models may be 
fitted to the time serioB as appropriate ,♦ 

iii) The recoupled equations are derived in terms of the 
decoupled models o.nd 
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iv) The model is formulated in terms of ([^oneral non- 
stationary univariate and multivariate models to facilitate 
inco rporation of seasonal variation of witlun-the-station 
and am ong -th o -s ta Irl o n c o rr e 1 at i o n s , 



3. :™^LIMINARY MALYSIS 01? I-IYDUOT.OGIC DATA 


3.1 Data Used for the Study 

Streamf low records for three major tributaries to a 
river In North India were available for the study. The avers-ge 
daily flow was expressed in cusecs and there were three values 
per month corresponding to the first bcni days ,, second ten days 
and th e re mai nd er of the mo nth . The s e ar e ref a r re d t o a s 
ten daily flows. Monthly avero^ge flows viere estimated as the 
average of the three consecutive ten d.'iily flovr values. Simi- 
larly average annual flows were estimated as the average of 
the t\/elve monthly values. The period over which concurrent 
data were available was 25 years only, liven though records 
vrere available in some stations for a longer period, only con- 
current data were used in this study. 

3.2 1‘requency Analysis of Streomflow Data 
3.2,1 Normal is at i on of d at a 

Noimal distribution for tho randoia varLablos confers 
some mathematical advantages. It is well anaHyced and documen- 
ted. Also, by virtue of its linearity ^ vi 2 i , , that a linear 
combination of normally distributed variables yields a normal 
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variable, it if5 amenable to algebraic raauiyjulation ♦ For 
ins tone e , b iv ar ia t e re gre s s io n and co I're la t io n mo del s s uc h as 
the Thomas -Fie ring modol(M:ias;:j ot al, ,196 2) are baaed go no rally 
on the assumption of normality. Another special feature of 
the normal distribution is with regard to the station ari by 
of the underlying stochastic process. Strict stationarity 
requires that the moments of all orders remain time invariant. 
But the estimation of moments of high orders is unreliable 
especiaJJ-y if the sample is short, Ilonco satisfying the 
requirement of sbationaiity becomes difficult. But if the 
underlying distribution is normal, stationarity of the first 
two moments implies strict stationarity. Estimation of the 
paraiiieters from these two moments can bo expected to be 
reliable. 

Stre amf lo^^r data can have only nonnegativo values and 
further the resulting distribution is generally skewed. In 
such cases, a skewed distribution may have to be fitted to 
the data. In order that the advantages ox the noimal distri- 
bution may still be utilisod, the skewed natural flow series 
may be transformed to a normal distribution. This is the 
basis of normalisation of data. The procedures generally 
adopted for normalisation include (i) N-tli root tranf3formation 
(Stidd, 1970), (ii) logarithmic transformation (Board, 1965? 
Young et al . , 1968) and (iii) Hilferty-V/ilson transf ormation 
(Beard, 1965 ) for gamma distributed data. 
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(i) N-th xoot trauGformation involves the derivation of the 
N-th root of the observations so that the transformed series 
follows a normal distribution, Squai-e and cube roo'b transforma- 
tions have been found to help in normalj.sation. They were not 
found to be valid for the data used in this study. 

(ii) Logaiithmic transf o mation (dhoiv", 1964) involves the 
consideration of the natuial or common logarithm of the vari- 
able. When the lower bound of the variable is nonzero, the 
probability distribution is referred to as a t hire e parameter 
3.ogarithmic distribution. In case one or more observations 
lie at the lower bound, it may be necessary to add a constant 
to alJ. the observations in order that the logarithms are 
finite. for the ds.ta used in this study, the additive term 
was not fcund necesstiry. Logarithmic trariaf.’o rmation has been 
found to be very useful in transforming highly skewed and 
nonnegative variables like stream flows to nearly normally 
distributed ones, 

(iii) If the original distribution is a gamma distribution 
(Pearson Type '3 distribution), it can bo transformed to a 
normal distribution, by the Hilferty -V/ilson tronsfo rmation 
(heard, 1965). ^fho procedure consists in first standardising 
the raw series so that the resulting series has zero mean 

and unit standard deviation. Lot be the Pearson Type 3 
standard deviate obtained fran the st and ar di sed data with 
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skew coefficient g. Then the corresponding normal (O, 1) 
deviate t^ can be obtained by the transformation 


- ' [(- 
g L , 


+ 1 ) 


1/3 


- 1 ] + 


G 


(3-1) 


In case the probability distribution is a log Pearson Type 3 
distribution^ it is necessary to use the logarithmic transfor- 
mation, standardis ation and then Hilf erty-Vfilson transforma- 
tion. 


3.2.2 P i ti; in g a pr o b ab il it y d i s t r i bu t Ion 

Probability distributions may bo fitted to random 
variables on the basis of available observations- This involves 
the assumption of a suitable the ore tic nl model, estiEiation of 
p ar amet e r s and t e s ting th e good nes s o f fit, 

-Estimation of parajneters: The parameteiB may be 
estimated generally by one o.f the three following; methods 
(Gnow, 1964; Yevjevichy 1972); 

i) Method of moments (MM); Melhod of moments relates the 
sample values of the moments to the pa3'amt:5ters of the dis- 
tribution. The r-th sample moment abou'b any arbitrary x(0) 
is given by, 

= -Y" S [x(i) - ^(0)] (3-2) 

i=i 
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where h is the size of the sample, For any distribution, 

the fiist moment 'about the ori^’in gives the sample mean 

and the second moment about the mean gives the sample variance, 




and 


X 



ii 

^ x(i) 
i=l 


(3.3) 


Var = 




N 

S 

i=l 


( X ( 1 ) - x ) 


(5.4) 


— . p 

X and Var (x) are the sam])lc estimates of p and o for the 
theoretical distribution. 


Estimation by the method of moments is asymptotically 
efficient and the efficiency is usually smaller than unity. 
For small samples, the estimates are significantly affected 
by extreme or off-control points that may be present in the 
sample . 

Let \i and cT be the mean and standard deviation of the 
1 o gno .rill ally die t r i bu te d v ari a ble . Us ing 'bl i e Mil , th e mean 
l-L(n) and the standa,rd deviation ci(n) of the nortaalised 
variab le are given by 


p(n) 

= In 

(5.5) 


p +0 


a(n) 

= [In ( )]^ 

(3.G) 


1 - 1 - 
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ii) Method of least squares (ML3); This inettod consists in 
the estimation of the parameters of the assumed distribution 
by minimizing the sum of squares of deviations of the observed 
points from the fitted function .Chow (1964) suggests the 
following procedure 3 If x and s are the sample mean and 
standard deviation, the frequency factor K is defined as 

K = (3.7) 

X 

For any given distribution, iC can be related to the cumulative 
distribution function F(x) of 'die di strilnition. For a given 
F(Xj) is estimated fran plotting pos'ition formulae. For 
the assumed distribution, IC is obtained fran F(x^). The 
regression 3. ino of x on K is determined by ML, 9, This gives 

estimates of sample mean x and s-brndard deviation s . Though 

A 

this procedure is not theoretically exact, it generally gives 
a better overall fit than the MM, and further, the estimate is 
not affected as much by extremely rare occurrences as in the 
case of MM, 

i i i ) Method o ;t“ maxi mum 1 ike 1 i ho o d ( Mi ) : Max d .mum 
likelihood estimate is that estimate o the pa,rameters of a 
distribution for wliich the probability of occurrence of the 
actual observations is a maximum. 

Let x(l), x(2), x(h) be the .IT observations and 

f(x), the assumed probability density function in terms of the 
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parameters p(l)y p(2)y ,,, p(J). As&umriig independence of 
eveiits, the probability of the outcome p is given by 

p = nr f[x(i)] (3,8) 

i=l 

Por this to be a maximum, p/3p = 0 for all j = 1,2,...J, 

The J equations in terms of the J parameters are solved to 
give the maximum likelihood estimates, I'or the noirnial dis- 
tribution, these e,stimates are the same as the estimates by 
the j'#i, for the t\’^o parameter lognormal distributiony the 
ML'fi of mean and standard deviation are given by the inean and 
standard deviation of the logarithm of the sample, - 

Test for the goodness of fit; In tlie present study 
the goodness of fit of the fitted disixibution to the observed 
data is tested by the chisquare test. The sample space is 
divided into I mutually exclusive dosses with a class fre- 
quency of 5 or more, Xet p(i) be the probability that the 
variable belong3 to the i-th clasvS for the assumed distribution. 
If x(i) and x(i+l) ar'e the limits of the i-th class interval, 
then 

p(i) = P[x(itl)] - P[x(i)] (3.9) 

Let f(i) be the observed frequency of the sample from the i-th 
group. If N is the total number of samples, the chisquare 
statistic is given by 
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i=l Np(i) 


( 3 . 10 ) 


If J i;i the number of parameters estima'tod^ then theorist.!- 
2 

cally '70' has a chisquaie distribution with (I - J - l) degrees 
of freedom. Let (o:) denote the value of Ot. at a'% confi- 
dence level for the above degrees of freedom as obtained from 
the tables. If the calculated % is gi\-ater tha]i the theo- 
retic al valu e , t h e n th e s am])le do v i ato s s i yii f Ic an tly f r om 
the assumed distribution at the given level and the fit is 
re je G te d . If it i s 1 es r.i y t h en th e fit i s ac c ep te d , 


In the present study, the transformed standardised 
data wore teste d for normality. The rsnge of the distribution 
was generally (viz., for all except in the case of the annual 
so'i.'ies ) divided into six classes. They are respectively 
~(?Q to -1.0, -1.0 to -0.5, -0.5 to 0.0, 0,0 to 0.5, 0.5 to 
3.0 and 1.0 to (7o , Pi’obs.bility distribution function values 
were obtained from statistical tables and they were used to 
estimate the probability values for the class intervals, A 
value of p ™ 95^) was c ho son as the c on fi donee level. 

Annual discharges? The mean and standard deviation 
of flows wore Gsti'oaiod by MM (Table 3.1). f'or all the thrce 
rivers, the hypothesis of normal distribution was accepted 
at 95^ level. 
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TABLE 3.1 ESTIKlATES OP AMmi 
NORMAL DISTRIBUTION 


n, 0\JS 


(mi) FOR 


RIVi.,R 

1 

2 

3 

7 , ^ (EiJTIMATJiD) 

5.3 

2,1 

10.0 

DEGREES OP PRETiDOM 

10 

4 

9 

( THEORETIC /i) AT 




95% LEVEL 

18.3 

9.5 

16.9 


Monthly discharge si The strermTlo\i data corresponding 
to each month were tuia lysed for freouoncy distrihution using 
parameter estimates by IM. The untransformed data were tested 
for normality (Table 3.2). Several cases were found to be 


TABll'i 3.2 EB!l'lr;A'.nEa OP MOlWHM H,0\iS (MM) FOR MOML 

IjiSTRIBUTION . 

(DEGmSS OP FEEBEOM = 3, pci = 7.8) 


MONTH 

.RIVER 

1 

2 

3 

4 

5 

6 7 

a 

9 

10 

11 

12 

1 

1.3 

0.3 

3.5 

11 . 6 + 

18 . 1+ 

0.8 12 . 0 -' 

■ 7.1 

8 . 7 + 

3.3 

2.1 

1.0 

2 

1.7 

1.3 

3.3 

1.3 

9.9’^ 

3.1 0.7 

0.7 

8.1"'' 

1.9 

0.8 

a.8''' 

3 

1.0 

2.6 

1.5 

3.9 

9 . 9 + 

9 . 9 * 1.5 

3 .4 

0 , 6 

4.0 

3.2 

13.3'^ 


+ Indicates significant values 

significantly different frcni normal distribution at 95 jr:'. lovol,' 
Hence the hypoth^is of normality of the original data was 
rejected. Several procedures for normalisation including the 
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N-th root transf o iraation , Hilf re ty -Wilson transformation and 
logarithmic tranaf o mation were tried. Based on results, 
logarithmic transformation was adopted in this study, Sttmdar- 
disation (Subsoc. 3.3) was done using paraineteis estimated 
respectively by MM, MIS using Chow’s freouency factor and 
MIE (Table 3.3). The goodness of fit was tested in each case. 
Only one value by MM was found to be significant at 95^ level 
for river 1; all values were insignificant for river 2 and for 
river 3 MLS had 3 si g;nif leant values and MLE 2 significant 
values. However all except one of the above values were 
insignificant at 95^ level. Hence lognomality was assumed 
to be valid whatever be the method used for estimation of 
peiiameters . 

Tendaily discharges; Streaniflow data were grouped into 
36 tendaily seasonal values , Using parameter estimates by 
MM, chisquarc test was performed to test normality. The fit 
was found to be not good at 95 ^ level f o r 15 ? 6 and 11 periods 
respectively for rivers 1, 2 and 3, and so the assumption of 
nonnality was rejected. The data were then logtrans formed 
using paiameters estimated respectively by MM, MIE and MIC 
and chisquare test was applied. The results are shown in 
Table 3.4. It is seen that logtransfo nnation improves the 
fit significantly and the 3 methods of standardisation are 
comparable thou^ individual variations exist from river to 
river. The fit of log-Pearsaa Type 3 distribution was also 
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TABUi; 3.5 ^ BSa’IMATBS OF LOGTRAUiTJFOEMED MONTHLY 

SBILIES FOR NORMAL DISTRIBUTION 

(DUUIiELS OF FREBDOM = 5, = 7.8) 


MONTH 

Mli'EiOD 

1 

2 

5 

4 5 

6 

7 

8 

9 

10 

11 

12 





RIVER 

1 







im 

0.7 

1.0 

0 , 6 

4.9 9.2+ 

0.2 

3.2 

5.8 

1.2 

1.0 

4.5 

2.5 

MLB 

0.7 

1.5 

0.6 

4.0 6,2 

0,2 

2 . 6 

3.5 

1.2 

1.0 

4.5 

1.3 

MLS 

0.2 

2.0 

1.1 

1.4 4.4 

0.3 

3.1 

5.8 

1.2 

0.9 

5.9 

1.3 





RIYBR 

2 







MM 

0.7 

2.1 

1.5 

1.5 5.7 

3.5 

0.7 

0,7 

7,7 

0 . 6 

o.a 

0.9 

MLB 

0.7 

2.1 

2.5 

2.6 1.2 

3.5 

0.7 

0.2 

2.2 

0 .6 

0.8 

1.6 

MLS 

1.2 

0.7 

2.5 

0.7 1.3 

1.7 

0.8 

0.2 

4.4 

0.7 

1.4 

1.4 


RIVER 3 


MM 

0,8 

1.6 

0 . 6 

0.7 

5.7 

2.3 

2.0 

5.0 0.7 

2.7 

o 

• 

CM 

7.3 

MLB 

0.4 

1,6 

0,8 

0,5 

5.6 

2.2 

1-5 

8.3'^0.7 

2,7 

1.0 

8.6-^ 

MLS 

0.4 

3.0 

0.8 

0.5 

8 . 4 + 

2.8 

2-0 

8. 3+0. 8 

2.7 

• 

CM 

8.6'^ 


+ Indicates significant values 
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TABLE 3.4 ILiJSlJLTS OP i'BbT 
BATA 


Lumber of Oases Significant at 95^?S Level 


BIS- 

TRI- 

BUTIOB' 


RIVER 


Mo nt h S e ri e s 

12 3 


Ten d a i 1^ S e r i e s 
12 3 


NORMAL (Mi-l) 
(MM) 

LUONORJ'IAL (MLE) 
(MLS) 


4 3 3 

10 0 
0 0 2 

0 0 3 


15 6 11 

3 3 3 

5 13 

2 0^ 


LOG^Bo ARSON TYXE 3 


3 1 


tried by apxjiyin/j; the Hilf er ty-VJilson transformation bo 
logtransfo rmed data standardised with the parameters of MLE. 
There seems to be no significant improvement because of 
Hilf erty-Vlilscn transformation. Hence only logarithmic 
t r an s f o jm at ion w as a d o pt e d in this s tud y , 

3.3 Standardisation of Data 

The parameters of the process, vis^, , mean and standard 
deviation of the monthly and tendaily data exhibit a change 
with season. This is referred to as cyclic nonstationarity , 
which has to be removed in order to moAe the process stationary. 



41 


Standardisation refers to the conversion of the above series 
to a series with z-ero mean and unit stando.rd deviation. Two 
procedures are generally used for this purposes 


i) iVonpara metric standardisations ilonpar ametric standardi- 
sation consists in estimating the seasonal mean and standard 
deviation and and using them to transform the given 
series y(i) to a series x(i) with zero mean and unit stan- 
dard deviation y as follows i 




y(i) - 



(3. 11) 


ii) Parametric standardis ation o Tovjevich (1971) suggests 
the fitting of a harmonic series each for the mean and 
standard deviation s^. Let the harm on if function fitted 
for the mean and standard deviation be and respectively. 

y 

Let 


y(i) - 

u 


(3.12) 


The Z(i) series will not have exactly zero mean and unit 
standard deviation. It is once again standardised to another 
series using the mean Z and standard dovia.tion of the Z(i) 
series. The new series is stationary in mean and standard 
deviation. Generally this procedure requires the estimation 
of less number of parameters than non parametric standardisa- 

(V^vi. TnOTl'- 

tion. In this study, the earlier^proce dure of nonparametric 



standardisation was follow eel to obtain standardised data. 
iSttmdardisation was carzled out sepamtely for monthly and 
tend ally data. us.Lng the parameters estimated by each one 
of the three methods, viz., MA, MLK cuid MLS. The noimali- 
sation and standardisation of streomflow data constitute 
the prelijoina,iy analyses. The normalised standardised data 
are used subse quen. tly in univariate and multivariate time 
series analyses. 



4. mu VARIATE STOCHASTIC MODELS 


4.1 ARIMA i'lodels 

A general introduction to univariate models has 
already been given in Sec. 2.1, Univariate modelling as 
adopted in this study is presented in this chapter. The 
class of nonlinear models Icnom as ARIMA models are often 
used for the represeri'tatiou of univariate time series. If 
the time series is nonstationary, differencing of the series 
is done a number of times if necessary till stationarity is 
achieved. A (p, d, q) AIHMA model is one vfhere the d~th 
differencing results in a model which is autoregressive to 
order p and moving average to order q, viz., it can be 
represented by 

d P ^ ^ 

V x(t) == Z .0^ x{t“i) -h €(t) 4- z e(t~j) (4.1) 

i=l j=.l 

cl 

where v refers to the d-th order differ one in g 5 the first 
group of terms in the ri^t hand side refers to the AR com- 
ponent 5 and the second group refer^s to the moving average 
component. A special case of the ALIMA model is the AltMA 
(p» 1 ) model v/here no differencing is needed. In selecting 
the ARIMA model for univariate analysis, the following 
advantages were noted; 
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(i) availability of a general methodology of selection among 
alternatives, and (ii) availability of an efficient computa- 
tional procedure which will locate the parameters of the ARIMA 
model (Box et al., 1970, McKerohar et al,, 1972, Nelson, 1973)* 
The procedure for ARIPiA and ARMA models are the same except 
that in the former the first step is to determine the order 
of differencing d. Wo differencing was needed for the norma- 
lised standardised data series and hence the methods for 
ARM A model alone are briefly dealt with in the following 
section. 


4.1.1 Identification of the model 


Identification refers to the choice of a particular 
model from the general family that is most appropriate to 
the sample time series and is helped by determining the 
correlation structure. The theoretical autocorrelation 
coef ficient is the ratio of the auto covariance at lag 
i to the variance of the process. In tho present study, 
the time series has already been standardised. So the .sample 
autocorrelation r^^ is equal bo the SEunple auto covariance and 
is given by 


r 


i 


1 

W-i 


W-i 

L 


X 


j+i 


(4.2) 


where K is the number of observations and Xj is the value of 

J 

the random variable at time j. Given the sample data, an 
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estimate ot* the autocorrelation function may be obtained by 
computing with i = 15,2^,.. • The largest possible subs- 
cript i would be W~1 although the computation is often 
carried to considerably less than N-1 lags. A visual 
device which is very useful in identification is the sample 
correlograra, a graph of sample autocorrelations. The object 
in studying the sample autocorrelation is to recognize in 
them a pattern typical of an AIIMA process whose correlograra 
structure is familiar. 

It is to be noted that sample autocorrelations are 
only estimates of the actual autocorrelations for the process 
which has generated the data on hand. The sample Eiutocorre- 
lations are hence subject to sampling errors, particularly 
for the smaAl sizes of data usually available for streamflows. 
Gonse quently sample autocorrelations may somet imes be largely 
in error and so they may not be indicative of the true auto- 
correlations. Hence while interpreting' the sample correlo- 
gram, the genei'al characteristics which are recognizable in 
the sample correlogram' should only bo loolced into and 
smaller details ignored. To distinguish what is significant 
from what is not; a test for statistical significance of 
sample autocorrelations may be made. 3?or a random normal 

series^ the autocorrelations should be normally distributed 

1 

about zero with variance so that all autocorrelations, 
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beyond + 1.96 times the standard errors may be considered 
significant at 95^^ level (Yevjevich, 1972). 

For a moving average proooss, the correlogram exhi- 
bits a cutoff at a certain lag beyond which it is not sig- 
nificant, thus facilitating the deteriaination of the order 
of the moving average process. For autoregressive models, 
the correlogram decays slowly and is not helpful in 
indicating the order of the model. In such cases a set of 
sample statistics, knovm as partial correlation coefficients 
helps in the estimation of the orders of autoregression. 

Consider a pure autoregressive model of order p, viz., 
P 

x(t) = 2 x(t-i) + e(t) (4.3) 

i=l ^ 

where is the AR coefficient of order i and e(t) is the 
residual. These coefficients are related to the autocorre- 
lation coef ficiento by th.e Yule -Walker equations given by 



r'l 

A 

P ■' 

p~l 

■ 

A 

• 

= i ... 

1 • 

k « • • • 

Pp-2 

• • • 

^ (4,4) 

♦ ♦ • 

L^PJ 

1 

• 

CM 

J 

L PJ 


If J^2_9 ••• p are laiown the above system of p 

eq^uations with p unknowns 02 * •••9 solved. 

In practice, the tiue values of p and J? are unknown, let 
p = 1. Using Yule-Walker equations and using the sample 


Sf 
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value ' population value, one gets = 0^^ 
where is the resulting estimate of If vSignifican- 

tly different from zero, it can be concluded that the 
process is at least of order one. To see whether the process 
is of order two or greater, the Yule— V/alker equations are 
solved for p = 2, 



If the resulting estinia.te of 02 <3iff^3rvS significantly from 

zero, it can be concluded that the process is atle ast of 

order two. This procedure is repeated successively for 

larger values of p. If the tiue order of the model is p^, 

then, when the system of equations is solved for p = t 1, 

the value of 0 will not be significantly different from 

p^+i 

zero since it is an estimate of 0^ , -i which is zero. Denoting 

p. -i-X 

by 0x± value of 0j^ implied by the solution for p = i, the 


are referred to as the estimated partial correlation 
coefficients of the process. If the order of autoregression 
is p^, then 

^ii ~ ^ ^ ^^t (4'.6) 


Since 'the partial cor i^elation coefficients are sample statis- 
tics and therefore subject to sampling error, a test is needed 
to decide when 0.. is indistinguishable from zero in a 
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statistical sense. Under the hypothesis that p > the 

Ct' 

approximate standard error ol p.. is gi 'ven by 


SB i 7- p 


(4.7) 


Thus it is inferred that p = p^ if 0^ ^ is small 

compaied to 1. 96/1/" IT. Hence the partial correlation function 
does for the AR rood el what the correlograra does for the 


moving average model. If the process is pure moving average, 
then the partial coireLation coefficients decline in magnitude 
vjith increasing lag and do not exhibit cutoff at any lag, A 
mixed ARMA model is iden'tlfied by a gradual decline in both 
the correlogiram and the partial correlation coefficients. 


In order to test whether the flow series is a realisa 
tion of a serial3-y uncorrelated process, one way is to see 
if the correlogram ordinates are witliin roughly twice the 
standard error (i.e., at 95 confidence level). But the 
standard error underestimates the standard deviation of 
sample auboco rr elation especially at lovr lags. To obviate 
this difficulty, a ' Q’ atatisllc that offers a test on the 
smallness of a whole set of sample autocorrelations for lags 
1 to K has been suggested (Box et al., 1970) viz., 

K 2 

Q = M S r/ (4. a) 

i=l ^ 

The Q statistic is approximately chi square distributed with 
(K-p-q) degrees of freedom. 
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Annual series; The annual data v^ere used to estimate 
the correlogram and partial corre lograin shorn in Figp, 4.1 
and 4.2, They indicate that AE or MA processes are not pre- 
sent and the series may be considered random. The partial 
correlogram had significant values at lags 8 and 10 for 
river 2 and only at lag 10 for river ].. The intermediate 
orders have insignificant coefficients. As the sample size 
is onJ-y 25, the estimates at higher lag may not be reliable. 

The Q statistics in the three cases are 8.8, 12,4 and 10.4, 
and are less than the theoretical chinciuare value of 18.51 
at 10 degrees of freedom at 9!3 perceni'. confidence level. Hence 
the hypothesis of independence of annual flows is accepted 
and the annual flow series may be considered as serially 
independent , 

Monthly series; Correlograms of the s tandeirdi se d 
monthly series of the 3 rivers are slTDvm in Fig. 4.3* They 
show a gradually doclining trend Indicating an AR process. The 
partial autocorrelation coefficients (Fig, 4.4) exhibit a 
s ign if i con t val uc at lag 1 and i ns ig n a.:C i cant valu es a t hi gh or 
lags suggesting a first order AR model. Furfchcrj the seria3, 
independencG of the monthly flows was tested using the Q 
statistic. The statistic had values far higher than the 
permissible value at 95^ level indi eating the rejection of 
the hypothesis of serial indepen denco . So a first order AR 
model seems to be suitable for logtronsf ormod monthly flows. 
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Tendally serie s s The corr el o grains of 'the standardised 
tendaily series shown in Fig* 4.5 exhibited a much slower ’ 
decay than the montiily series and tlio partial correlation 
coefficient (Fig* 4.6) had signif icaiut values at 95 95^ level 
at lags 1 and 3 for rivers 1,2 and 3; and lag 2 also for 
livers 1 and 3. The value for river 2 at lag 2 is nearly 
significant at 95?^ level. The presence of monthly persis- 
tence vras already borne out by a significant lag one coeffi- 
cient in the monthly time series. Tho Q statistic for the 
three rivers were respectively 1110, 1078 and 1080 at 10 
degrees of freedom, being very much greater than the theore- 
tical value of 18.31. Hence the hypothesis ofserial indepen- 
dence was rejected. A third order AR model is hence indicated 


for the -ton cl ally series. It may 
ap ‘TO X ima 1 1 o 11 a ( 1 , 1 ) /ilM A mo d e 1 
But this was not adopted in this 


bo ^po t-jsi ble to fit so an 

f r 0 tfj par;, imo ny o o n Jij 1 d e r a t i u u g . 
study. 


4.1.2 F s t' i mat ion of p ar ajiio tors 


For a pure AR prociess , the estimation of the starting 
values of the 0 coefficients is straightforward and is done 
using the Yule-V/alker equations relating these coefficients 
to the ordinates of the corr elograra . The solution of this 
system of equations supplies the starting estimates. 


The relationship between autocorrelation coefficients 
and parameters is nonlinear for moving average and mixed 
processes and hence the computation of initial estimates 
is more involved. G-enerally tiie following procedure is 
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adopted. Different values are assumed for the parameters 
and the sums of squares of residuals are obtained after 
fitting the model to the data* The sum of squared errors 
is plotted as a function of parameters and the region of 
minimum sum of squared errors is identified, A value in 
this region is used as a starting point. The initial 
estimates axe inserted into an algorithm that is iterative 
in nature and the final estimates ax’s calculated. The 
method of nonlinear least squares regression (Box and 
Jenkins, 1970) is used for this purpose. Program listings 
developed by McKerchar et al . , (1972) were used to find out 
the final estimates of the parameters as well ^ ‘the stan- 
dard errors of estimates of these paxametexs. 


To test the significance of the paxameter estimates 
the following procedure is adopted: The parameter estimates 
are assumed to be Jointly normally distributed with mean 
value equal to the true paxame ter valu es and variance- 
covariance matrix given by (Nelscn, 1973) 




5)2 


SlSl 


'-1 


aB(p+q+l) i 


V(B) = 2a^(u) 


L 


r^'B'Cp+'q+l) bTIT '^-2 


(p+q+1) 




(4.9)’ 
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where B is the vector of the p + q + 1 parameters of 0 and 
3(3 ) is the sum of squared errors 5 a (u) is estimated the 
sample variance and second derivatives of S(b) are numerically 
evaluate d at the pointsof the parameters estimates . The 
square roots of the diagonal eleirents of idle variance- 
covariance matrix give the standard erroi’s SE(B(i)). Under 
the hypothesis that the true parameter values B(i) are zero, 

B( i ) /SB(B( i ) ) is approximately normally distributed so that 
a 95^ confidence limit for B(i) is given by -1,96 3U(B(i)) 
B(l) • 1.96 Sh(B(i)) , 

Parameter estimates for monthly and tendaily flows, 

AR models were identified in Subsec 4«1*1. The initial 
estimates of the parameters were made us ing Yule-Wallcer 
equations, Prom the values of the serial correlation coeffi- 
cients at various lags, the above equations were solved for 
the initial estimtes. V7hen 'these values were inserted in 
the iterative algorithm, the final estima-bes of the parame- 
ters converged rapidly. The convergence was quicker for the 
monthly series than for the tendad.ly series. In fact as 
the process is purely AR, initial estimates by Yule-Wallcer 
equations wore not necessaiy at all fo r c cnvergence and the 
initial estimate of zero yielded a fas'b oonvorgence towards 
the final values. The estinates of the parameters together 
with their standard errois are gi-ven in Table 4.I for the 
monthly flows for a first order AR model and in Table 4*2 for 
tendaily flows for AR models of orders 1, 2 and 3o 
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TABLjii 4*1 PARAJffiTER ESTIMATES EOR MOimiLY SERIES 
(STATIONARY UNIVARIATE AR MODEL) 


RIVER 

ORDER 

OP 

MODEL 

GOBPPI- 

CIBNT 

INITIAL 

ESTIMATE 

PIiTAL 

ESTiriATB 

STAI'fDAT^D 
ERI.’OR OP 
ESTIMATE 

RESIDUAL 

VARIANCE 

1 

1 

H 

0.52 

0.53-76 

0.0495 

0.7330 

2 

1 


0,57 

0.5714 

0.0475 

0.6747 

3 

1 

^1 

0.56 

0.559P 

0.0495 

0.6916 


4.1,3 Validation of the fitted model 

The adequacy of the model is checked thr’ough diagnostic 
checking (Box and Jenkins, 1970) by an exaiaination of the 
char act eristicG of the lesiduEil series. Univariate modelling 
is based on the a^ssuraption that the residual series is serially 
independent, furthermore f or an AR procesFj with a normal mar- 
ginal distribution, the residuals should also be normally 
distributed. Hence the fitted model is ccnsidered adequate 
if each residual series constitutes an independent normally 
distributed series. It should be noted that the Independence 
is necessaiy while nomality is desirable. On the basis of 
the parameter estimates already made, the residual series is 
calculated and tested for normality and independence, 

Normality of Residuals; Ghisquare test was used to 
test the goodness of fit of normal distribution to the 
residual series. The test was done separately for each season. 
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TABLE 4.2 PARAfflTER ESTIHATE FOR TENDAILY SERIES 
(STATIONARY OtfIVARIATB AR MOLBL ) 


RIVER 

MODEL 

GOBFPI- 

craHTS 

IMTIAL 

ESTIMATE 

EIKAL 

ESTIMATE 

STATOARD 
ERTfOR OP 
ESTIMATE 

EESIRUAI. 

VARIANCE 

1 1 


0,62 

0.6219 

0.0261 

0.6174 

2 


0,53 

0.5352 

0.0331 

0.6033 


02 

0.14 

0.1397 

0.0330 


3 

01 

0,32 

0.5240 

0.0333 

0.6002 


02 

0.09 

0.0972 

0.0375 



03 

0.09 

0.0796 

0.0333 


2 1 

01 

0.67 

0.6739 

0.0246 

0,5467 

2 


0.63 

0.6340 

0.0332 

0.5455 


02 

0. 06 

0.0592 

0.0332 


3 

01 

0.62 

0,6265 

0.0333 

0.5371 


02 

-0.03 

-0.0223 

0.0391 



03 

0.14 

0.1285 

0.0331 


3 1 

01 

0.67 

0.6723 

0.0247 

0,5481 

2 


0 . 61 

0.6145 

0.0333 

0,5448 


02 

0.09 

0.0860 

0.0333 


3 

01 

0.60 

0,6054 

0.0332 

0,5392 


02 

0.03 

0.0205 

0.0388 



03 

0.10 

0.1066 

0.0332 
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(i) Monthly series; The results of the chisquare test on 
the residuals are given in Table 4.3. The results of the 
chisquare test for the monthly series are given in Table 3*3. 

TABLh 4.3 ^ ESTIMATES OF UEIVARIATS MOITTHLY RESIDUALS 
(STATIONARY MODEL) 


MONTH 

1 

2 

3 4 

5 

6 

7 

8 

9 : 

10 : 

11 : 

12 




RIVER 

1 








TC2 

>^E3TIMATE 

1.3 

2.1 

2.6 2,6 

19.6 

4.4 

10.5 

1.3 

2,3 

6 , 1 

2,6 

2.5 

DOF 

3 

3 

3 3 

3 

3 

3 

3 

3 

3 

3 

3 

^THEORBTIGAL 

7.3 

7.8 

7.8 7.8 

7.8 

7.8 

7.8 

7.8 

7.8 

7.8 

7.8 

7.8 




RIVER 

2 








TL^ 

'^ESTIMATE 

1.2 

0.8 

1.4 3.1 

8.2 

4.0 

1.6 

1.5 

0.3 

3.1 

1.0 

4.6 

DOF 

3 

3 

3 3 

3 

3 

3 

3 

3 

3 

3 

3 

2 * 

^’THEORETICAL 

7.8 

7.8 

7.8 7.8 

7.8 

7.8 

7.8 

7.8 

7.8 

7.8 

7.8 

7.8 




RIVFiR 3 








2 

nSSTIMATE 

5.2 

3.4 

2.2 1.3 

14.7 

5.7 

1.8 

7.0 

3.7 

7.8 

4.2 

1.9 

DOF 

4 

4 

4 4 

3 

4 

3 

4 

4 

3 

4 

4 

^THEORETICAL 

9.5 

9.5 

9.5 9.5 

7.8 

9.5 

7.8 

9.5 

9.5 

7.8 

9.5 

9,5 


at 95^ confidence level 
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A comparison of the two indicates that the fit of the noimal 
distribution to the i^sidusls for seasons 5 and 7 for river 
1 and for season 5 for rivers 2 and 3 are not good, Por 
river 1 during season 5 the fit x^as not good for the original 
data as well as for the residuals. The fit v^as good in all 
the other cases. 

(ii) Tend ally series; The results of chisquare test for 
the goodness of fit of a normal distribution for the residueils 
of AR models of orders 1,2 and 3 are summarised in Table 4.4. 

TABLE 4.4 RESULTS OP 96 ^ TEST Oi^f UNIVARIATE 

TEIIUAILY RESIDUALS (STATIONARY MODEL) 


RIVER 

MmiBER OP 

SIUNIPIGANT 

VALUES 

I ORDER 

II ORDER 

III ORDER 

1 

5 

6 

8 

2 

3 

5 

6 

3 

5 

6 

6 


A comparison of Table 4.4 with Table 5*4 (b) indicates that 
the goodness of fit is not satisfied for a much larger 
number of cases in the case of the residuals than in the 
case of the logtrans formed serievS. Purthermore there is 
no consistency with reference to the seasons in that for 
some seasons the fit was good for the logtransf ormed series 
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and not for the residuals and vice versa for some others* 

The detailed results of the chisquare test are not given 
here* They indicated that the goodness of fit was always 
accepted for the monsoon season for the original data as 
well as the lesidusls and was rejected generally for some 
nonmonsoon seasons. They further indicated that the sea- 
sons in which the test for goodness of fit was rejected for 
the residuals clustered around the seasons for which the 
goodness of fit was rejected for the logtransf ormed data. 

Independence of residuals; Univariate modell ing o f 
the data can be regarded as an attempt to find a transfor- 
mation that removes the internal persistence from the data, 
viz., the residuals should he serially uncorrelated. Hence 
the correlogram of the residuals is tes'ted to check whether 
the residuals are serially independent. The tests described 
in Buhsec, 4.1*1 can be used for this purpose. 

The test for independence of residuals can in addi- 
tion be performed in the frequency domain. If the residual 
seriCvS is pure randan, then the power spoctra of the resi- 
duals correspond to white noise spectra. For the 3.atter, 
the spectral density over the frequency range is constant 
and is the average of the computed values of the spectra. 

The spectial density function is estimated by calculating 
the autocorrelation function of the series and then 
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Fourier -t rans fo rm ing it. The sample spectral estimates are 
dislrributed about the population spectrum according to a 
/'p distribution y where P , the number oi degrees of 
freedom is estimated according to 



where m stands for the number of lags used for the estimation 
of correlation coefficients. Prom statistical tables, 

Xi ) and 7:^(1 - ^ given level of 

significance a. From these values, the values of 
are readily determined. If Cr(f) denotes the spectral density 
for pure random process and that for the given sample, 

then Gj^(f)/G-(f) has a distribution and the series is 

pure random at confidence level (1-a) if 





G(f) 




{'Y’9 P ) 


(4.11) 


There are a number of other procedures including nonpar arae trie 
methods for testing independence. They may be obtained from 
reference (Wallis et al,, 1971) • 


Monthly residuals; The lesiduals for the monthly 
series for the first order AR model v/ere tested for aerial 
independence. Treating the residuals as a stationary series, 
the correlograxii was estimo-ted upto a lag of 60 (Fig. 4.7). 
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Assiimirii^^ normality, the hypothesis that the correlogram 
ordinates are not significantly different from ^iero was 
tested at 95^ level. All values vrere lying within the 
95^ confidence limits. In addition the Q statistic was 
also estimated (Table 4.5) and found to he not significant 
thus confirming independence . When the spectra of residuals 
were tested for randomness (Pig. 4.8); they were also found 
to be within the 95^ confidence limits for all the three 
rivers. Hence the monthly residuals for all the three 
rivers can be ccnsidered -to be random series, 

Tendaily residuals; The correlogram of the residuals 
of the first, second and third order models (Pi^ . 4.9? 4*10? 
and 4.11) showed that 6 to 10 percent of the values exceeded 
the 95 confidence limits, and that the third order model 
is better than the first and second order models. V/hen the 
Q statistic was computed (Table 4.5) the relative superio- 
rity of each of the models was noticed. As the order of the 
model increased, the statistic decreased in magnitude. However, 
for the first and second order models, these va.lues were 
higher thai the allowable values, Por 'the third order model, 
the above hypothesis was satisfactory for rivers 2 and 5 and 
for river 1, the statistic was marginally higher than the 
permissible value, A larger lag model may eliminate this 
problem, but was not considered necessary for this study. 



TABL3 4.5 Q-STATISTICS OP UNIVARIATB RBISI DUALS 
(STATIONARY MODELS) 

MONTHLY SERIES 

ORDER OF 


MODEL 


RIVER 1 

RIVER 2 

RIVER 35 

1 

Q 

35.5 

15.8 

33.2 


DOF 

29 

29 

29 


7 2 

95^h 

42.6 

42 . 6 

42.6 



TEMDAILY SFUIES 


ORDE.R 





OF MOD-WD 


RIVER_^1 

RIVER 2 

RIVER^3 

1 

Q 

58.4'^ 

48. a'*’ 

59. 


DOF 

29 

29 

29 


f 2 

95f. 

42.6 

42,6 

42.6 

2 

Q 

52. 

44. J"*" 

54.0"^ 


DOF 

28 

28 

28 


2 

95 % 

41.3 

41.3 

41.3 

3 

Q 

40. s’*" 

27.8 

38.3 


DOF 

27 

27 

27 


/ 2 

95% 

40.1 

40.1 

40.1 


+ denotea significance at the 955?i level. 
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The spec tra of the tend ally re.siduals are plotted in 
Fig. 4.12 along with the confidence hand for a white noise 
spectra and they indicate that the first and second order 
models are not adequate to represent persistence in the 
series and that the third order model is necessary. Hence it 
was decided to adopt the third order model for the tend ally 
series . 

Seasonwise independence of residualss The correlogram 
analysis of the residuals of the stationary Ali model enables 
the testing of the independence of the residuals as a whole, 
i.e., treated as a single stationary series. Whether from 
sea.son to season the residuals satisfy the condition of 
serial independence is not revealed by the correlogram analy- 
sis. The latter can be calculated after grouping the residuals 
according to seasons and estimating the correlation between 
the residuals belonging to a particular season and those 
belonging to the immediately earlier season. This is referred 
to as the 'seasonal serial correlation coefficient'. For 
instance^ considering the monthly residual series, the corre- 
lation coefficient botv/een the residuals of January and the 
previous month, December, may be estimated and tested for 
the signiiicancG of the residual correlation. The hypothesis 
that the population correlation coefficient betv/een two sea- 
sonal residuals is zero is set up. The t statistic is 
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calculated usin^: 

t = (4.12) 

f(i-(rJ)2) 

where N is the number of pairs of the residuals in the sample 
and r^ is the sample correlation coefficient between the j-^th 
and (j-l)'-th seasons. If the calcula-ted t statistic is less 
than the tabular value et the given lov^'l of significance 
for (11-2) degrees of freed cm, the hypothcsivS is accepted. 

Seasonal variation does not arise for the annual 
series. Seasonwiae serial correlation coefficients for the 
residuals of the montHy seriCvS are s.ho^/n in Fig, 4 .13 and 
for the tendaily series in Figs. 4.14, 4.15 and 4.16. It can 
be seen that the coefficient is significant for the monthly 
series only on one occasion for rivers 2 and 3. For the 
tendaily series, the number of such instances are greater. 

For the first order model, the reaiduaJ. s possessed signifi- 
cant correlations between adjacent seasons for 7, 3 and 7 
seasons for rivers 1, 2 and 3 respectively. For the second 
order model, these figures were 9^ 4 and 8 showing an increase 
for all the rivers. For the third order model, rivers 1 and 
3 showed an improvement in the independence of residuals with 
G seasonal values being significant. It was thus seen that 
increasing the order of autoregressive model did not always 
result in an improvement of the se ason-to-season independence 
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ox the residuals. It could also be observed that during 
the monsoon months, the residuals did not generally have 
any significant seasonal coir elation whereas during the 
autumn months, the correlations were very l^irge and signi- 
ficant. The presence of significant correlation from 
season to season in the residuals showed that when examined 
on a season by season basis the model used for the univariate 
analysis was not able to fully account for the intenial 
persistence during all seasons of the year, although when 
examined on an annual basis the model resulted in a residual 
series free from internal correlation. This may be due to 
the averaging the effects of tho several seasons taken as a 
whole. 


4-1.4 Variance of the residuals 

For the univariate model, the total variance of the 
process may be divided into a variance that is accounted for 
by the persistence and a residual variance. The latter may 
be attributed to the component that is serially uncorrelateci , 
After the model is fitted to the data tho variance of the 
residual series can bo estimated thooix-tically or empirically 
from the residuals. 

The data series have already been transformed and 
standardised and hence have unit variance. The variances of 
residuals for the monthly series for the three rivers were 
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respectively 0.7r>30, 0,6747 and 0,6916. For the tendaily 
series, the variances of residuals vie re calculated for the 
first, second and third order models, fhey aie sh.O'\-jn in 
Table 4.2. As the order of the model increases, there is 
a slight reduction in the variance. For river 1, for 
instance, when the order of the model increases from 1 to 
2, the variance decreases from 0.6147 to 0.6003, vis., about 
1.04^ of the original variance, and vdien the order increases 
from 2 to 3, there is a further reduction of 0,31^^ of the 
total variance. For rivers 2 and 3, the overall reduction 
in variance, is 0.96 ^ and 0,89 only, to the third order 
model from the first order model. 

In order to see how much of the persistence component 
was accounted for during each season, estjmation of variance 
was made separately for each season. Residuals belonging 
to each season were grouped together and their variances 
were estimated. Results for tho monthly series are shown in 
Fig. 4.17 and for the tendaily scries (third order model) in 
Fig, 4.18. From season to season, the variances varied in 
magnitude, ThcBe values were seen to be generally higher 
during th e monsoon season than during nonmonsoon season with 
a low in the autumn season. In other words, the persistence 
component appears to vary seasonally and the variance of the 
residuals estirnatod earlier averages out those variations 
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yielding a unique value for the entire scries, for some 
seasons y mostly during the monsoon, the residual variances 
were greater than the original variances. This may be due 
to the estimated value of the AR convponent being higher 
than the a,ctual autoregression prevailing during monsoon. 

The overestimated persistence coraponeiit vrauld have resulted 
in a large negative value for the random component resulting 
in a large variance. 

It is thus seen that when the residuals are treated 
as a single stationary series irrespective of the seasons, 
the variance estimates as well as the corrclograra analysis 
seem to yield consistent resilLts , But if the characteristics 
of residuals arc examined on a seasonal basis, the residuals 
are not seasonally independent, and the variance estimates 
indicate nonstationarity of residuals. So a stationary'' ' 
univariate model is not satisfactory for the monthly and ten-- 
daily series , 

4.2 0 ns t at ion a ry Mo d els 

In the modelling of monthly arid tendaily series, 
differencing was not found to be necessary, furthermore an 
AR component alone is identified and T^IA components arc not 
needed. However, a stationary AR model is found to bo not 
satisfactory. Hence it seems necessary to consider a 
nonstationary AR model. 
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4,2,1 Iden-cif ication of the model 

A genexal nonstationary model may consist of a time 
varying autoregressive component and a tine varying moving 
average component and may be represented as 

x(t) ^ Z x(t-i) + Z E(t-i) + e{t) (4-13) 

i=l ^ i=X ^ 

where and 0^ stand respectively for the autoregressive 
and moving average coefficients of order i for the season 
j, j =1^2^.,, 12 for the montlxLy series and j = 1,2,,.. 36 
for the teiidaily series. The coirelograin cannot be used 
here for the purpose of identification because it is based on 
the assumption of stationarity of the correlation structure. 
However for the purposes of identifying the order and form of 
the models assuming that they do not change from season to 
season, the correlogram maybe, used. Once the form and order 
of the model are identified, they are fitted into the data 
for the entire series. This stationary approach is followed 
only for the purx^oses of identification of the model. No 
parameter estimation for the model is involved. 

In accordance with the lesultEs of Subsoc . 4.1.1? AH 
models of first and third order were used respectively for 
monthly and tendaily series , viz, , q was equal to zero and 
p was independent of j . 
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The monthly series and the tendaHy series are then 
represented respectively by 


(t) = x(t-l) + E (t) , 


(4.14) 


and 


x(t) = x(t-'l) + 0^2 x(t“?) + x(t-3) + e^(t) ( 4 . 15 ) 


The parameteiB are to be estimated xor each season j for 
each river, 


4»2.2 Estimation of parameters 


Yule-Walker relationships p* An extension of the 
Yule-V/alker relationships (Eq, 4.4) is used for the solution 
of AR coefficients of the seasonally nonstationary time 
series. Let be the i-th order seasonal correlation 
coefficient for j-th season, i.e,, it is the correlation 
coefficient between the variable during the j-th season and 
that during the (j-i)“th seascn. Then r^, r^ and repre- 
sent respectively the seasonal correlation coefficient of 
first second and third order for the j-th season. 


for a first order AR. model, i.e, for p = 1, 


= 


(4.16) 


For a second order AR model , i.e., fo r p = 2 , 
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and 

- r^rJ/(l-(r^)2) (4-17) 

For a thiid order AR model with p = 3 (YevJevichj 1972) 
the AR coefficients and are respectively given by 

0^ = [ (l-r^ ) (r^+(i4)^-r^r^ ) J /A and 

*^3 = [ ( )-(l-r^)( r|r^-r| ) J /A (4,18) 

where A = ( l-r^ ) ( l--2( ^ + r^). 

The seasonal correlation coefficients r? aie estimated 
separately for each season j. As the data are already stan- 
dardised, the seasonal correlation coefficients are mvnerically 
equal to the covariances. The i-th order seasonal correla- 
tion coefficient for the j-th season is estimated by 

where is the vector of the variables for the j-th season. 

Method of least squares^ The genei’al p-th order AR 
model can be regarded as a multiple linear regression equa- 
tion, viz., in the equation 

x(t) = 20 ^ x(t-i) + e(t), 
i=l 


(4.20) 
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where the coefficients may be solved by the method of least 
squares. The x(t) series generally has a zero mean and unit 
standard deviation. If standardisation is done using the 
seasonwisG mean and standard deviation by Mh or by the MLlj 
using Chow's frequency factor^ standardisation does not 
result in zero mean and unit standard deviation. Then the 
autoregressive model includes an additional term to repre- 
sent the nonzero mean 5 


x(t) = + 2^ 0^ x(t-i) s(t) 

i=l 

and the coefficients 0^^ i= 0 , 2 ^ . -(t 5 j = 1,2,,.. 

solved by the method of least squares. The solution 
obtained from the following set of normal equations? 


( 4 . 21 ) 


a are 


is 


r 


^ xCt-l) 


g;x( t-i) 2;x'^( t-i) 


^x(t-p) ^2<'x(t) 

sj X ( t -1 ’;x( t-'P ) 0^ , Six ( t ) x( t -1 ) 


• • « 

5;x(t-*j)) •5 ix(t-i)x(t-p) 


• • « « ■ o 


t-p) 


• • fl • « a 

.1 L^r 1 l^('‘:)x(t-p) j 

( 4 . 22 ) 


For the first order model, the AK coefficients as 
obtained by the assumption of Yule-V/alkor relations and by 
the method of least squares yielded almost identical results. 
However, when the order of the model was raised, the Yule- 
Walker equations started giving unstable estimates, more so 
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for the third order model than for the second order model. 
This result was irrespective of whether the standardisation 
was done by MM, MLS or MLS. Hence in tliis study, the method 
of least squares was used for the estimation of the coeffi- 
cients of the ii'odel, Sstimates of the AI?. coefficients using 
standardisation by MLE and MIS are given in Table 4«6 for 
the monthly series. The coefficient estimates for the 
tendaily series for the first, second and third order AH 
models are not given here, 

4.2,3 Validation of the Model 

Normality of the residuals? The residuals were 
determined by substituting the estimated value of parame- 
ters in the model equation* They were grouped in accordance 
with the season to which they belonged* The goodness of fit 
of the normal distribution was verified by estimating the 
chi-square statistic for the residuals of each season. 

(i) Monthly series* Results for goodness of fit at 95 
confidence level are shown in Table 4-7. The fit was not 
good for 2 months out of 12 for river 1 and for rivers 2 
and 3 for only one month whatever be the method of standardi- 
sation, As normality is only desirable and not essential 
the results were considered satisfactory for this study. 
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TABias 4.6 P;iRAMETER ESTIMATES FOR MOOTHLY SERIES 
(HORST ATI Ox'IARY Ul'IIVARI ATE FIRST ORDER 
AR IlODSL ) 


Month 

_ Autoreg, 

RIVER 1 

MLE MLS MLE 

^0 ^1 

ressive 

RIY.7R 2 
Mil 

0 

Ooeff ic ients 

RIVER 5 

3 MLE MLS 

1 

-,30 

.56 

.26 

-.00 

.17 

-.01 

.28 

.90 

,22 

2 

.46 

1.77 

.39 

.58 

1.47 

.56 

.48 

1.53 

.40 

3 

.38 

.94 

,17 

. 64 

"■.46 

.73 

.51 

-.13 

. 53 

4 

.37 

.27 

.37 

.43 

.15 

.43 

.36 

.01 

.35 

5 

,69 

-.34 

.71 

,60 

,02 

.72 

.73 

.31 

.77 

6 

.85 

-.02 

.82 

.80 

-.43 

.77 

.88 

.13 

.89 

7 

.57 

.17 

.58 

.67 

-.07 

.67 

.33 

-.18 

.31 

8 

.17 

-.14 

.17 

.65 

.91 

. 67 

.64 

27 

.70 

9 

.51 

-.14 

.50 

.50 

.12 

.51 

.59 

.19 

.53 

10 

.83 

.13 

.82 

.69 

“ ,46 

,68 

.63 

-.49 

.67 

11 

.78 

-.23 

.79 

.71 

.38 

.70 

.66 

.12 

. 64 

12 

.67 

.59 

.67 

.44 

.43 

,46 

.56 

. 28 

.61 


TABIE 4.7 RESULTS OF TEST ON UNIVARIATE RESIDUALS 
(NONSTATIONAEY M0B:iLS ) 

NUtffiEE OF OCCASIONS OF SI CS'IIFICANT 
ESTIMATES 


RIVER MONTHLY SERIES TBNDAILY SERIES 
ORDER ORDER 

1 

1 2 9 9 8 

2 1 .222 

3 1 4 4 3 
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(ii) Tendaily series: The residuals of the first, second 

and third order models were deteimined for the tendaily 
series of each river. 36 groups of tendaily residual 
series corresponding to each season were identified and 
individually tested for normality. lb was noted that the 
numbers of occasions when fit was not 5 ^ 20 d were same for 
all methods of standardisation of data. As the order of the 
model was increased, from first to second, there was no 
improvement in fit. But, for the third order model, rivers 
1 and 3 showed a marginal improvement. It is likely that 
by using a higher lag model, the goodness of fit may be 
increased. This would involve the estimation of more number 
of coefficients . For a particular length of data, then, 
reliability of the estimates may decrease. Furthermore, as 
stated earlier, normality is not essential. Hence the 
results were considered satisfactory for the model used in 
this study. 

Independence of the residuals: The serial independence 
of the residuals was tested by correlogr£WQ and spectral 
analysis and the results -were satisfactory for both the 
monthly and tendaily series. The seasonal independence of 
the residuals was checked for the monthly and tendaily series 
using the procedures of Subsec 4.1.3. 
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It vas found that for all months and for all the methods of 
standardisation, excejit for one month each for rivers 1 and 
3 using H-'IM, the fit was good (Fig. 4.19). The seasonvrise 
serial correlation coefficients of the residuals of the 
tendaily series for the first, second, and third order 
models are shown in Figs. 4.20 to 4.26 (for MIS only). 

For the first order model, the residuals have significant 
serial correlation coefficients for 3, 2 and 2 seasons 
respectively for rivers 1, 2 and 3. For the higher order 
models, there are no significant seasonal serial correlation 
coefficients except when the method of least squares is used 
for -river 2(2 seasons). This is within 5^ of the total 
number of seasons. Hence it can be concluded that provided 
a second or third order model is used for the tendaily 
series, the residuals are serially and seasonally unco ir elated 
whatever be the method of standardisation. 

4.2,4 Variance of residuals 

(i) Monthly scries J Variances were estimated for the 
seasonal residuals of the first order nio del (Figp. 4.27 to 
4.29). Of the thiee methods used for standardisation, MLS 
appears to give a lower estimate of the variance than the 
other two methods. The other two methods seem compa 2 :able to 
each other. Late autumn, early winter and summer seem to be 
the seasons with minimum unexplained variance and it is during 
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the seasons of monsoon and winter precipitation that it is 
highest. This conclusion seems reasonable and is identical 
for all the three methods of standardisation. 

(ii) Tendaily series s The variances of the residuals were 
obtained for the firsts second and third order models using 
MMy MLE and MLS (Figs. 4.30 to 4.38). As in the case of 
monthly series,, these variances were lowest during late 
autumn and early winter andhighest during the monsoon 
season. Also, it was noted that as the order of the model 
was increased from first to third, at each step there was 
some reduction in variance. Among the three methods of 
standardisation used, MLS again returned a lower residual 
variance on larger number of occasions than MM and MLE, For 
instance, for the third order model, when the coefficients 
were estimated by MLS there were less residual variances 
in 20 out of 36 seasons than when estimation was done by MM 
or MLE. It was also observed that MLS gave less variance 
more often during the monsoon months and spring. In autumn 
and winter, MM appeared to give less variance. A disturbing 
feature of MM was that, during some seasons, the residual 
variances were higher than the original veiriances . 
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4*5 Conclusion 

In selectitig a univariate model for the logarithms 
of strearaflows at each site, the follov/ing points may be 
noted : 

(i) The residuals of the stationary model display signifi- 
cant seasonwise serial con-elation. Increasing the order of the 
model does not show improvement in serial independence. On 

the other hand, the residuals of the nonstationary model are 
serially independent. This is because the persistence struc- 
ture of the fonner is represented by unique autoregressive 
coefficient for the whole year, whereas the second model 
takes into consideration the time varying nature of the 
correlation structure. This no doubt inci^ases the number of 
coefficients to be estimated. 

(ii) Because of their stationary nature, the residuals of the 
stationary model have a unique value of the ^rianco for the 
entire year. If the variances are estinated separately for 
each season using the above residuals, the residual variances 
are sometimes higher than the original variance. Por the 
nons tationary model in which the parameters are estimated on 
a seasonal basis such a problem does not arise, 

(iii) For the stationaiy models, the entire length of the 
historical data, after due transformation smd standardisation, 
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is -treated as a single sample and the coefficients are esti- 
mated. The nonstationary analysis divides the total sample 
into s omples of smaller length and the coefficients are 
estimated from such small samples, Reduction in sample size 
results in a reduced reliability in the latter case. Like-- 
wiSGy when the residuals are checked for independence^ the 
correlogramof residuals in a stationary model treats the 
entire data length as a single sample while the nonstationary 
model tests the correlation coefficients of residuals sea- 
sonally, utilising samples of shorter length. For the monthly 
series, the reduction is from 12 to 1 and for the tendaily 
series from 36 to 1. 

(iv) The decoupling of the multivariate model implies that 
the univariate residuals should as far as possible he pure 
random. The uso of a stationa.iy AR model results in a 
nonstationary series of residuals with a soasonall^^ varying 
correlation structure. On the other hand> the uso of the 
nonstationary AR model results in a stationary residual 
scries. Hence for the data considered, it is necessary to 
use a nonstationary AR model for representing the univariate 
process . 

On the basis of the above considerations the following 
univariate models are adopted for further study; The annual 
series can be taken as serially uncorrelated. As such, a 
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zeroeth order univariate model seems suitable to represent 
this series. A first order nons tationaiy AR model is satis- 
factory for the univariate modelling; of the monthly data. A 
third order nonsts tionaiy AR model is satisfactory for the 
univariate modelling of the tendaily d.rf.ta. For river 1, — . 
however, the representation seems to be not satisfactory 
with reference to the probability distribution. As normality 
is not an essential criterion, a none tationary third order 
AR model is chosen for all the three rive is. 
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MULTIVARIATE; MOLEiLLIETG 


5.1 Decoupled Multivariate Stochastic Models 

The univariate residual series e(t) are determined 
from the nonstationary model 

P -i 

x(t) = 2 : x(t-i) + e(t), j = 1,2,... s (5.1) 

i=l 

where e(t) is a serially independent random variable with 

1 2 

mean zero and variance (s“) , and s is the number of seasons 
within an year. Vihen the number of stations is greater 
than one, the model can be represented as 

P 

Xj^(t) = Y. t~i) + k 1,2,... iC and 

i=l 

j = 1 , 2 ,... s (5.2) 

where Si-Ct) is a serially independent rendom variable at time 
t with mean zero and variance (s^ in season 3 for the Ic-th 
site and K is the- total number of sites. 

The set of serially uncorrelated series v/ith 

k = 1 ,.,. K constitutes the multivariate time series in this 

study. Although decoupling of the univariate model has 

removed the internal dependence of each of the component 

time series, external dependence may still be present among 

the time series. Knowing the standard deviation s^ , the 

^k 

set of series is standardised to an series 
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with zero mean and unit vStandard deviation. Hence for the 
nonstationary univariate residuals 

T 


{E(t)} = ^^(t), ©^(t), 

El(t) £2^“^) 

j f »»»< 

^2 


J 


t) 


T 


(5.3) 


^1 "-2 ^IC 

{E(t)] constitutes the multivariate series of serially 
independent components , It may he represented hy the multi- 
variate AREA, model of order (p, q) as followss 

■^E(t)] = [0^] ■[E(t~u)^ + [D^] ^p(t'hl-v)j , 


3 1^2p4o»s 


(5.4) 


Earlier studies in multivariate modelling'Of streamflows 
(Eat alas > 1967? Young and Pisano? 1968) have indicated that a 
multivariate first order autoregressive model may ho suffi- 
cient for the representation of multisite streamf lows. Henoo 
in this study it vras assumed that u = 1 and v = 1 so that the 
above equation reduces to the nonstationary multivariate 
first order autoregressive model. 

{E(t)]- = [ch {E(t--l)^ + [D^] {n(t)} (5.5) 

Assumption of stationarity results in [C^] and [D^] being 
independent of j , so that 

{l3(t)'^ = [C] {E(t-l)]: + [D] {Ti(-fc)} 


(5.6) 
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which is of the same form as proposed by Matalas (1967). 

For the nonstat ionaiy model, the coefficient matrices 
[C^] and [D^] may be evaluated in a manner analogous to the 
procedures of Matalas (1967) aud Younj^.: aud Pisano (1968), 
Letting the lagzero and la gone correlation matrices among 
the '^^l(t)^ series for the j-th season to be represented by 
[M^] and respec tiveily , the coefficients of the model 

may be estimated by 

[ob = [wbl [Mb"^ and (5.7) 

[DbCD^]'^' = [mJJ - [ob[Mi]'^ (5.8) 

Assuming a lower triangular form for the matrix [D^], a 
recursive method of obtaining a unique solution for [D^] was 
presented by Young and Pisano (1968) for the case j = 1. The 
same procedure ms used in this study for estimating the 
coefficients. In case the data are none oncurrent and [M^] 
is not positive definite, the procedures su^ested by Fiering 
(1968). Crosby and Maddock (1970) and ;Fuller (1974 ) niay be 
used to adjust the elements of the correlation matrix in 
order to make them consistent. They vrere not lequired in 
this study for the stationa,ry model as concurrent data were 


used . 
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5.2 Decoupled Multivariate Stationary Model 

5.2.1 Estimation of parameters 

The annual series arc serially independent and pure 
random. So they constitute the s tan da. rdt sed multivariate 
data. Three different univariate models have been fitted 
to the monthly and tencfe^ily series usin^’ MM, MLE and MLS 
for parameter estimation. The univariate standardised 
residuals corresponding to the three methods were determined 
for fitting multivariate models. V/hen the results for the 
three methods of standardisation are comparable, the results 
for only one method are presented. In the case of all the 
series, a stationary multivariate first order AK model was 
initially assumed. The coefficient matrices [C] and [D] were 
estimated us ing the procedure in Sec. 5.1. 

5.2.2 Significance tests on parameters 

Mat alas and Vfallis (1972) have pointed out that [M^] 
Tt 

and [DD J should bo positive definite in order that the 
multivariate first order AR model may bo fitted to any data 
series . Let K*]l or define the determinant of [•] , 

Thus even if 0, it should be separately checked if 

; m I 

I DD j 7“ 0, It was found that for stationary multivariate 
models, both the above requirements were met with for annual, 
monthly aiid tendaily series. 
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[G] matrix is a measure of the lag one cross correla- 
tion among the •[l;(t)y series. Whether it is significantly 
different fron zero may be tested by a multidimensional 
test of significance due to Anderson (1958). Let 


P = ] M_ 1 an d (5.9) 

0 I 0 i 

h = l”o 1“ (5.10) 

Lefine a statistic Z given by 


P — P 

.7 _ 0., 1 H - m 

^ • m 

0 


(5.11) 


where IT is the size of the sample , m is the order of the 
matrix and m = K. Z is approximately distributed as chi- 
square v/ith m degrees of freedom. If the estimated Z is 
less than the theoretical value, at, say, 95^ level, then 
the estimated Z statistic and hence [G] are not statistically 
significant at that level. 


In physical systems, there may be physical reasons to 
justify the presence of serial or cross coir elation even though 
they may not be statistically significant. It is the general 
practice to take into account these correlations when they 
explain more than, say, 5^ of the variance of the series 
even if the coefficients are not statistically significant. 

A comparison of (Bq. 5.10) with [LL^] (Bq. 5.8) indicates 
that Pq is a measure of the multivariate variance while is 



118 


a measure of the residual multivariate variance. Since the 
rivers considered are tributaries to the same major river and 
they lie adjacent to each other in an essentially homogeneous 
hydrometeorologica.! region, a correlation seems meaningful 
even when they are not statistically significant provided 
they explain more than 5?^ of the variance. 

Annual series 2 The results of the analyses for the 
annual series are given in Table 5.1- J 

both positive definite and the Z statistic is not significant 
at 95^ level. However, the variance explained by [O] is 
of the order of 54^ of the total variance and appears to be 
significant. Hence [D] seems to be physically meaningful even 
though it is not statistically significant and so it is 
retained in the model. 

Monthly series j The results of the analyses for the 
monthly series are given in Table 5*2. [ DD J are 

both positive definite in all the cases. The values of Z 
statistic are not significant at 95^ level implying thereby 
that the [C] matrices are not statistically significant at 
that level. The variances explained by the [0] matrices are 
of the order of 4 ^ of the total variance and hence the 
[0] matrices are considered to be not physically significant. 
Hence the monthly series can be represonted by a multivariate 
model of zeroeth order, viz., 
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TABLiii 5.1 PARiJlETER ESTIMATES FOR STATIONARY 
OTLTI VARIATE MODEL (ANNUAL SERIES) 


-.62 

-.43 

. 95 ] 

j 


r.89 


-.44 

-.42 

. 64 1 

[DJ 

CNJ 

• 

II 

.70 

-*.28 

-.53 

.60J 


u- 74 

.39 

^E - 

4 .OO 5 ^t,95^ 

7.82 

1 DD^ I = 

.07 


F 

o 




0.5 


TABLE 5.2 PARAMETER ESTIMATES FOR STATIONARY FIRST ORDER 
MULTIVARIATE MODEL (MONTHLY SERIES) 


MM 

MIE 


MLS 


>■.03 -.14 . 03 I 

1 1 

1 03 - . 10 

-.02'! 

T-os -.13 

.O^i 

I 

= ;.13 -.06 -.02 .• 

: 1 

.14 -.05 

1 

-.01 1 

!.12 -.06 
i 

-.01! 

1 

|_.16 -.08 -.03j' 

.15 -.08 

•-.03J 

L.17 -.10 

-. 02 J 

= 3.96; 

^E ~ 4“ • 

f 

2-E = 3.88j 



^T,955i " 

7.32 

^T,95f» ^ 

82 

r.99 

i ’ 

.00 

1 

r.99 

1 

n 

1 

=^1.58 .81 
! 

1 

.62 .78 

I 

1 

1.58 .81 

i 

|_.42 .37 .82^ 

.49 .32 

.aij 

j_.35 .31 

.88J 

T { 

DD'^I = .42 

1 dd'’^ 1 = 

40 

[db^I = . 

50 


Fo~F 


1 



0.04 
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{B(t)} = [D] (5.12} 

This result is similar to that of Yevjevich for the net 
basin supplies to the G-reat lahes of North America (1975). 
Assuming [D] to be lower diagonal, the elements of the 
[ Dj matrix are e stima te d f rom the equation 

[DD®] = [Mq] (5.13) 

The elements of the [ P] matrix estimated from Bqs* 5.12 
and 5.15 are given in Table 5.2(a). A comparison of 
Tables 5.2 and 5* 2(a) indicates that the coefficients of 
the [ D] matrix have increased only very slightly from the 
first order model to the zeroth order model. This confirms 
the adequacy of the zeroeth order model. 

TABLE 5.2(a) PARAMETBP. ESTIMATES FOR STATIONARY 2ER0ETH 
ORDER MULTIVARIATE MODEL (MONTPILY SERIES) 



1-'' 

• 

o 

o 

1 


IffiE 

'i.oo 1 ri.oo 

; j 

MLS 

R 

i 

[L] = 

.59 

C\J 

00 


.64 .79 j j .59 

.81 



L. .44 

CO 

• 

CO 

• 


.51 .34 .83 j L .37 

.33 

.89j 


1 T 

|dd-^ 

I = .45 


/dN| = .43 |dd^ 

1 = . 

52 


Tendaily 

serie s : 

The results of the analyses for the 


tendaily 

series 

are given 

in 

Table 5.3. [M^] and [DD^] 

are 



positive definite in all cases and the Z statistic is not 
significant at 95^ level. The variances explained by [C] 
matrices are of the order of of the total variance. 
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TABLE 5.3 PARAi'lETER ESTIMATES FOR STATIONARY FIRST ORDER 
MULTIVARIATE MODEL (TBNDAILY SERIES) 


[G] 


[D] 




MM 




MLE 



MLS 

, 02 ' 

r- 

-.03 

. 06 « 

•. 02 " 

r- 

.03 

, 06 

-. 02 '^ 

r_.o4 

1 

.07 

= 

.06 

-.01 - 

.06 


.06 

.00 

— , 06 

i .05 - 

j 

,01 

.07 

L 

.01 

.11 - 

.04„ 

L 

.01 

.11 

~. 0 ^ 

L -01 

.10 

.05 



5.53 




5. 

40 


5.93 




7.82 



^ - 

7. 

82 

"t = 

7.82 


I 

.,00 


“1 

fl 

.00 


n 

h.oo 

r 



= 

.56 

.83 

j 

; 

o 56 

.83 

\ 

’ .54 

t 

.84 


L 

.43 

.27 

CO 

u 

.43 

.27 

.861 

-j 

i .43 

L 

.26 

,86 


Idd^ 

\ 

i = . 

50 


' T 
(DD-^ 

1 = 

.50 


! = .52 



^ 0-^1 





0.02 


Zfjy is the theoretical value at 
95^ CL with K Degrees of freedom. 


Hence the [C] matrices are considered to he statistically and 
physically not significant. The ten daily series can hence he 
represented hy the multivariate zeroeth order AR model. The 
elements of [D] estimated from Eqs. 5.12 and 5.13 are given 
in Table 5.3(a). A comparison of the results of Tables 5.3 
and 5.3(9') indicates that the corresponding elements differ 
only very slightly thus confirming the adequacy of the zeroeth 
order model. 
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TABIiE 5.3(a) PAiUMBTER ESTIMATES FOR STATIONARY ZEROSTH 
ORDER lyOJlTIVARIATE MODEL (TBNDAILY SERIES) 


ri.oo 

1 

ri.oo 

1 

! h.oo 

i 1 


1 

.57 

.83 ' 

t [ 

1 

! -57 

.83 

i 

.85 

. 

L .44 

,28 .87j 

L .44 

. 28 .87j 

L -44 

.27 

.87j 


= ,51 


Idd^I = .51 

Idd^I 


.55 


5.2,3 Testing of residuals 

The multivariate residuals were detertnined from 

the multivariate time series and the fitted models. They were 
then tested for normality by the chisquare testj independence 
within the time series by correlogram and spectral analyses; 
seasonwise independence within Ihe time series by the seasonwise 
first serial correlation coefficient; independence among the 
time series by cross correlogram, and cdierence spectra; and 
seasonwise independence aroong the time series by the season- 
wise lagzero cross correlation coefficients. 

Normalitys The multivariate residuals for the annual 
series we ire tested for normality by the chisquare test 
(Table 5.4) and were accepted at 95^ level. The results for 
the chisquare tests on the monthly series are given in 
Table 5.5. A comparison of these with those forthe univariate 
monthly residuals (Table 4.7) indicates that the results are 
comparable for MM and MUB and are better for the multivariate 
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TABLE 5.4 j3 ESTHETES OF MULTIVARIATE ABEUAL RESIDUALS 
RIVER 12 3 


1-^ 

'^ESTIMATE 

,2 

1.0 

.7 

VC 2 

theoretical'' 

7.8 

• 

C'J 

7.8 

+ Degrees of 

freedom = 

3 5 Sigilificance level ~ 95 


TABLE 5.5 ESTIMATES OR MULTIVARIATE MONTHLY 

ItESIDUALS (STATIONARY ZEROETH ORDER MODEL) 


NUMBER OP MONTHS \VHEN HYPOTHESIS IS RKJEG'TED 
AT 95/o LEVEL 


RIVER 


MLE MLS 


12 2 1 
2 111 


0 


residuals for MLS. The results of the chisquare test on 
the multivariate tend ally residual series are given in 
Table 5.6. A comparison with the results for univariate 
residuals (Table 4.7) indicates a better fit for station 2 
and a comparable fit for stations 1 and 3 in the case of 
MM and MLE and a comparable fit in the case of MLS. The 
goodness of fit is hence considered satisfactory in the 
case of annual monthly and tendaily series. 


124 


Independence within the time series s The correxo- 
gram and power spectra of each of the time series in the 
multivariate residuals ■^'n(t)^ are determined by standard 
procedures and tested for serial independence (tSubsec. 4*1.3). 
The correlograms and power spectra for the annual series are 
shov/n along with the 95^ confidence limits for the vfhite 
noise series in Fig. 5.1(a) and Fig, 5.1(b), They indicate 
that the series are not significantly different from pure 
random series at 95/^ level. The correlograms and power 
spectra of the multivariate residuals for the monthly and 
tendaily series (MLS only) are sho^m in Figs. 5.2(a) and 
5.2(b) and 5.3(a) and 5.3(b) respectively. The residuals 
are not significantly different from the white noise series 
at 95/^ level. 

Seasonwise independence within the time series s The 
first serial correlation coefficients between consecutive 
seasons are estimated for each of the monthly and tendaily 
series and tested for significance. The results (for MLS 
only) are shown in Fig. 5.4(a) for the monthly series 
and Figure 5.4 (b) for the tendaily series. They 

indicate that the seasonwise serial correlation coefficients 
are statistically not significant at 959^ level. 

Independence among the time series s In order to 
test if the ^Ti(t)^ residuals for each of the time series 
were mutually independent, the cross corrologram between 
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the series taking two series at a tL'fie were estimated. 

They were tested for their significance at 95^ level. 

Usiag the coherence spectra between any two time series, the 
independence among the time series wa-s tested in the fre- 
quency domain. The significance of these spectra was ascer- 
tained by the procedure due to Granger and Hatanka (1964-). 

If the sample size is N and the cross spectrum is estimated 
over ra frequency bands, then the distribution of coherence 
when the true coherence is zero at the frequency f is given 
by 

P(f) = 1 - (1 - f2)W/m (5.14) 

They have also provided a table giving the least significant 
values of the estimated coherence spectra for various values 
of W/m. This was used in the present study to determine 
if the estimated values were significant at 95?^ level, 
Furthermore, for the monthly and ten daily multivariate 
residual series t , the lagzero crosscorrelation 

coefficient for each season was tested for significance at 
95^ confidence level* 

The cross correlogram and the coherence spectra for 
the annual series taken in pairs are shovm in Fig. 5* 5(a) 
and Fig* 5.5(b) respectively, along with the 95^ confi- 
dence limits. The crosscorrelation and the coherence 
between the time series are found to be not significantly 
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different from zero. The cross correlogram and the coherence 
spectra for the monthly series are shov/n in Fig . 5.5(c) and 
Pig. 5.5(d) respectively along with their 95 confidence 
liifiits. They indicate that the '{'n('t)j[ series may be consi- 
dered to be a pure random time series. The seasonwise cross- 
correlation coefficiei ts between any two maithly residual 
series are plotted in Fig. 5.6, Between stations 1 and 2 
only MLE indicates a month with significant cro sscorr elation , 
Between stations 1 and 3, all the three methods indicate one 
month with significant c ross correlation among the residua3.s. 

Between rivers 2 and 3, MM and MIS indicate t^o months 
with significant crosscorrelation , The cross correlograms 
betv/een'v.the tenda-ily series of stations 1 and 2 , 

1 and 3 and 2 and 3 shown in Fig. 5.7. The coherence 
spectra are shown in Fig. 5*8. They indicate that these 
series can be regarded as mutually independent at 95^ Cl, 
Seasonwise c ross correlation coefficients for the three methods 
of standardisation viz., M, MIE and MIS are shown in Figs. 

5.9(a), (b) and (c). IM seems to yield residuals with the 
least number of significant seasonal crosscorrelation coeffi- 
cients , closely followed by MLS, The number of seasons when 
these coefficients are signif icantly different from zero are 
4,7 and 4 between stations 1 and 2; 4, 4 and 4 between stations 
1 and 5^ and 7, 10 and 9 between stations 2 and 3, for MM, 

MLB and MLS respectively. Hence the ^'.i(t)|’ series retain 
significant cross correlation in atleast 11 to 20 of the seasons. 
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TABLE 5.6 ESTIMATES OE MBLTIVAKIATE TSiTDAILY 

RESIDUALS (STATIONARY ZEROETH ORDER MODEL) 

NUr^BER OF SEASONS V/l-IEN PiYDOTHESIS IS REJECTED 
AT 95^ LEVEL 


RIVER 


STANDARDISATION BY 
MM MLS MLS 


1 9 9 7 

2 112 

3 3 3 4 


5*2.4 Discussion of results for stationaiy multivariate models 

The results of fitting a stationa3:y multivariate model 
for the annual, montlily and tend ally series indicate the 
following; 

i) The stationaiy multivariate first order AR model 
represents the annual series adequately and the residuals 
constitute independent random series, and there is no signi- 
ficant coherence between the residuals* 

ii) The monthly and tendaily series can he represented 
hy a multivariate zeroeth order AR model with serially 
independent residuals. This agrees with the ccnclusion 

of Yevjevich (1975) for the net basin supply to the Great 
Lakes of North America. But there is a significant cross- 
correlation hetvj-een the residuals for one or two months 
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out of 12 in the case of monthly series, and a significant 
number of seasons for the tendaily series. 

iii) In case the sigaificant seasonal crosscorrelation 
between the residua3. series is ignored, the stationaiy multi- 
variate AR model can be considered to be Q- satisfactory repre- 
sentation of the' univa.riate residuals in terms of independent 
randan components. 

5.3 Decoupled Ronstationaiy Multivariate Models 

A stationary multivariate model is found to be 
satisfactory for the annual time series. Stationary multi- 
variate models for monthly and tendaily series indicate 
significant seasonwise crosscoirelation between the multi- 
variate residual series. It hence seems necessary to consider 
nonstationary multivariate models for the process. It is 
proposed to use a first order AR model with seasonally vari- 
able [Cj and [D] matrices. 

5.3.1 Esthnation of parameters 

The standardised univariate residuals E(t) of the 
monthly and tendaily series, corresponding to MM, MLB and MLS 
were used in the multivariate modelling of the time series. 

The coefficient matrices [C^] and [D^] where J is the 
seasonal index were estimated using the procedure in Sec. 5.1. 
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It may be noted that the sample size for estimation of 
parameters of the nons tationary model is N/s as against N 
for the stationary model. The estimates of the coefficient 
matrices for monthly residuals are given in Table 5.7. For 
tendaily series, it was not possible to fit a nonstationary 
multivariate AR model for 4 seasons out . of 36, viz., during 
seasons with serial numbers 11, 24, 30 and 36. This was 
because, during these seasons, [D'^] was not positive 

definite iraplying that the variance explained by the [0^] 
matrix was larger than the original vr-: riance and hence the 
[L^^^had to explain a negative variance. This is not possible 
as the elements of [D^] are assumed to be real quantities . 
Matalas and Wallis (1971) have indicated that under this 
conditions, the Young and Pisano model cannotbo used to 
represent the data. It may be possible to use other models 
or else to use procedures suggested by Fiering (1968), 

Crosby and Maddock (1970) etc,, in the case of inconsistent 
matrices. This was considered to be outside the scope of 
this study. Hence whenever such an inconsistency was met 
with, the simpler zeroeth order multivariate model (Eq,5.12) 
which corresponds to that of Yevjevich (1975) was used. This 
will be valid as [M^] is positive definite. 

The [O^] matrices obtained from the nonstationaiy 
model can be considered as realisations of a stationary [0] 
matrix. It is desirable to test whether the [C^] matrices 
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TABLE 5.7 PARAI-ISTER ESTIPIATES EOR i'TOHSTATIONARY EIR3T 
ORDER MULTIVARIATE MODEL (MONTHLY SERIES) 

^THEORETICAL, =5’^ ^m.ORETXG AL, 


MONTH 


STANDARDISATION BY 




MM 


MLE 



MLS 


1 

-.13 

-.45 ,31 

,01 

-.21 

.05 

-.07 

-.38 

^.20 


.05 

-.05 -,03 

-.24 

.00 

.12 

.07 

-.01 

-.09 


-.09 

-.53 -.39 

-.37 

-.46 

.53 

-.02 

-.38 

.19 


2]3 = 

1.7 


1.5 



1,8 



= 

0.4 


0.4 



0,5 


2 

.13 

-.22 ,18 

,01 

- .22 

.20 

.20 

-.13 

-.01 


.35 

-.20 -.09 

-.04 

-.01 

.02 

-.31 

-.11 

-.21 


-.14 

-.05 .21 

.28 

.03 

.19 

-.01 

-.04 

.12 


2e 

= 3.3 


4.2 



1.0 




= 0,8 


1.1 



0.3 


3 

-.16 

.39 -.23 

-,35 

.25 

06 

-.06 

.28 

-.16 


-.41 

-,07 .14 

-.54 

.20 

.04 

-.33 

-.04 

-.21 


-.12 

.42 -.38 

-.28 

.53 

-.40 

-.02 

-.16 

-.11 



= 8.4+ 


8.8+ 



3.7 



•*-E 

= 2.1 


2.2 



0.9 



contd . , . « 
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MONTH 


STANDARDISATION BY 



MM 



MLE 



MLS 


4 

-.05 

.44 

-.49 

-.45 

.96 - 

.57 

.02 

-.03 

-.01 


-.15 

1.03 

-.88 

-.31 

1.19 - 

.90 

-.09 

.42 

25 


.13 

1.00 

-.98 

.21 

.87 - 

.93 

-.04 

.55 

-.44 



= 9 , 8 + 



10 . 4 ^ 



1.8 




= 2.5 



2.6 



0.5 


5 

-.26 

.13 

.25 

-.63 

.08 

.55 

-.27 

. 13 ' 

. 28 


.11 

-.36 

.33 

-.41 

-.08 

.46 

.11 

-.39 

.42 


.31 

-.41 

,26 

-.20 

-.12 

.27 

.28 

-.16 

.09 



= 5.5 



6.2 



3.7 




= 1.8 



1.6 



0.9 


6 

-.12 

-.23 

-.05 

-.12 

-.15 “ 

.14 

-.16 

-.22 

-.04 


.20 

-.35 

-.48 

-.12 

.23 - 

.37 

.15 

-.36 

-.39 


.35 

-.41 

28 

-.07 

-.24 - 

.15 

.32 

-.36 

-.27 



- 7.2 



8 . 3 ’^ 



7.0 




= 1.8 



2.1 



1.8 


7 

. 60 

26 

-.20 

.69 

-.33 - 

.29 

,61 

-.27 

-.23 


.86 

-.40 

16 

.98 

-. 49 ' - 

,18 

.88 

-,40 

-.24 


..52 

- 1.0 

! 

• 

CD 

* 62 

-.17 - 

.40 

.51 

-.20 

-.33 



= 9 . 0 + 



9 - 2 '^ 



8 , 6 '^ 




= 2.3 



2.5 



2,2 



Contd , . . 
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STMCAHDISATIOIT BY 


MONTH 




MM 



MLE 



MIS 


8 

.24 

-.05 - 

.07 

.22 

.01 - 

.03 

.25 

-.03 

-.10 


-.40 

.45 

.25 

^.51 

.54 

.31 

-.57 

-.44 

. 21 


.16 

-.56 

.25 

.23 

-.42 

.21 

. 20 

-.45 

.24 



= 8 . 5 + 



10 . 4 + 



8 , 0 + 




= 2.1 



2,6 



2.0 


9 

.36 

-.80 

.61 

.24 

-.68 

.51 

.28 

-.70 

.49 


.67 

-.72 

.56 

.54 

-.60 

.55 

.61 

-.53 

.38 


.05 

-.03 - 

.07 

.08 

-.04 - 

.07 

-.01 

-.07 

-.02 



= 14 . 5 "^ 



16 . 2 + 



13 . 7 + 




= 5.6 



4.1 



3.9 


10 

.32 

-.11 - 

.66 

.44 

-.31 - 

. 62 

.30 

-.13 

-.60 


.25 

.05 - 

.52 

.46 

-.10 - 

.37 

.25 

,00 

-.27 


.49 

-.08 - 

.30 

.66 

22 - 

. 55 

.49 

-.09 

-.30 


Zt-:^ 

= 6,0 



6.0 



5.5 



1 ''J 










E - 

E 

- 1.5 



1.5 



1.4 


11 

.05 

-.63 

.46 

.16 

-.03 

.19 

.17 

-.81 

.63 


.49 

-.49 

.52 

.18 

-.17 

,55 

* 60 

-.67 

. 68 


.20 

, 26 

.00 

.14 

.37 - 

■.05 

.21 

.19 

.02 



= 10 . 1 ^ 



11 . 5 '^ 



9 . 0 + 



% 

= 2,5 



2,9 



2.3 



Oontd . . . 
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MONTH 


STMDARDISATIOi) BY 



MM 


MLB 



MLS 


12 .53 

-.24 '.31 

-.48 

. 66 - 

.34 

. 28 

-.31 -. 

18 

-.25 

.52 .25 

-.39 

.56 - 

.17 

28 

.52 -. 

22 

-.19 

-.05 .33 

- . 46 

.07 

.42 

-.08 

. 26 

04 


= 11.8*^ 


12.7'^ 



11.0+ 



= 3.0 


3. 2 



2.8 



+ indicates significant values 


[D] MATRIX 


MONTH 


M 


STMIIAEDISATXO 

MIB 

J BY 

MLS 


IS [dd'^'] 
PD ? 

1 

.94 



.98 



.95 





.39 

.92 


.56 

• 81 


.59 

.92 


YBS 


.41 

.58 

.59 

. 61 

.37 

.53 

.40 

.54 

.68 


2 

.99 



.99 



-99 





.73 

. 64 


. 76 

.64 


.73 

.65 


YBS 


.66 

.38 

.63 

.65 

.32 

.66 

• 61 

.26 

.74 


3 

.96 



.96 



.96 





.50 

.78 


,61 

. 68 


.48 

.77 


YES 


.56 

.53 

.55 

.58 

.43 

.57 

-15 

.01 

.97 



Contd . , . 


MONTH 


STMDARDISATION BY 


IS [DBT] 

MT-'I MLE MLS 


4 

.97 



.94 



1.0 





. 64 

.61 


.75 

.47 


.73 

. 64 


YES 


.45 

.52 

.44 

.57 

.42 

.41 

.51 

.53 

.65 


5 

.93 



.92 



.92 





,48 

.84 


.74 

.59 


.45 

.83 


YES 


.29 

. 60 

. 68 

.75 

.16 

.61 

.27 

.57 

.74 


6 

.95 



.96 



.95 





.33 

.78 


.24 

.82 


.57 

.80 


YES 


.25 

-.35 

.79 

.39 

-.29 

.81 

.24 

-.36 

,81 


7 

.88 



.84 



.87 





.31 

.67 


. 26 

. 60 


.29 

. 6 6 


YES 


.25 

.14 

.83 

.19 

. 06 

.82 

.22 

.01 

.87 


8 

.98 



.98 



-98 





.73 

.57 


.77 

.50 


.73 

.58 


YES 


.40 

-.10 

CO 

• 

.48 

-.24 

.76 

.38 

-.07 ' 

' .83 


9 

.79 



.86 



.83 





.50 

.55 


.50 

, 60 


.58 

. 56 


YES 


.59 

.56 

.58 

.52 

.56 

. 64 

.56 

.35 

.74 



Gontd. . , 
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MONTH 




STAI'JDARDISATIOH 

MLE 

BY 

IttS 


IS [DDT] 
PD? 

10 

.80 



.78 



.85 





.50 

.82 


.57 

.72 


.52 

.82 


YES 


.49 

. 20 

.75 

.45 

.09 

.74 

.48 

.20 

.75 


11 

.85 



.98 



.82 





.40 

.81 


-.07 

.96 


.56 

.80 


YES 


-.05 

. 60 

.70 

-.58 

.49 

. 68 

-.09 

,71 

,62 


12 

.95 



.89 



.95 





.84 

.46 


.60 

.71 


.34 

.46 


YES 


.74 

.28 

.56 

. 65 

.41 

.51 

.63 

.51 

.69 
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are significantly different from the stationary [0] rnatrirc 
derived for a stationary model. Let be the Z statistic 
estimated for the j-th season (Subsec. 5.2,2). and Z 
are approximately distributed according to chi square distri- 
bution and with the same number of degrees of freedom K, 

Hence the statistic F given by Z^/Z v/ill be approximately 
distributed acco rding to P distribution with (K, K) degrees 
of freed on. If the estimate of P exceeds the theoretical 
value at, say, 95 5^' confidence level, then the hypothesis 
that P is significantly different from unity is accepted. 

This is considered to imply that the [C^] matrix is statis- 
tically different from the [C] matrix at the same level, 

5*3.2 Significance tests on parameters 

Validation of the nonatationary model was done using 
the procedure of Subsec. 5.2.2. The [D^][D^]^ matrix was 
tested for each j for positive definiteness, statistic 
was also estimated for all j and used to test whether the 
[O^j matrices were significant for each j. The P statistic 
was used to test whether the [0^] matrices of the nonstationary 
first order model were significantly different from the 
[C] raatT^ix of the stationary first order model, Por a 
zeroeth order stationary model the test on P statistic is 
inapplicable . 
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Monthly series: matrices were found to 

be positive definite for all months of the- year. The results 
of the tests on [G^] matrices are given in Table 5.7. They 
indicate that in about 7 of the 12 months, [G^J is signifi- 
cantly different frcm zero. Results of the F test show 
that the above [C*^] matrices are not sigiif icantly different 
from the [G] matrix of the stationaiy first order model 
(Sec. 5.2). 

Tendaily series: The results of the testsu on 
ten daily series are given in Table 5.8. matrices 

were not positive definite^ for 4 out of 36 seasons. As 
indicated in Subsec. 5.3*1? during these seasons, the 
simpler zeroeth order multivariate model was used. The test 
for Z statistic is not applicable for these seasons. Results 
of the Z test showed that, out of the remaining 32 seasons, 

[C^] was found to be signif icantly different from zero during 
11 seasons. Results of the P test showed that the [C'^] matrices 
for the nonstationary first order model were not significantly 
different frcm the [C] matrix of the stationary first order 
model • 

5.3.3 Validation of the fitted nonsta tionaiy models 

Normality of residuals: The results of the chisquare 
test ‘ on the residuals for the monthly series are given in 
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TABLE 5.8 2 AND F STATISTICS FOB BONSTATIOJ^ABY 

imTIVABlATE MODEL ( T3NDAILY SERIES) 


SEA- 

SOB 

^EST 

F 

•^EST 

ORDER 

OP 

MODEL 

SEA- 

SON 

^BST 

^EST 

OBDBB 

OF 

MODEL « 

1 

1.3 

0.2 

1 

19 

4.4 

0.8 

1 

2 

3.7 

0.6 

1 

20 

3.7 

0 . 6 

1 

3 

7.3 

1.2 

1 

21 

21. P'*' 

3-5 

1 

4 

2,6 

0.4 

1 

22 

2.1 

0.4 

1 

5 

4.3 

0.7 

1 

23 

10.8‘‘' 

1.8 

1 

6 

12. r 

2.0 

1 

24 

o 

• 

o 

0.0 

0 

7 

0.9 

CM 

• 

o 

1 

25 

H 

00 

3.1 

1 

8 

s.e’'' 

1.4 

1 

26 

2.4 

0.4 

1 

9 

3.0 

0.5 

1 

27 

19. e'*' 

3.3 

1 

10 

1.4 

0.2 

1 

28 

13. 2'*’ 

2.2 

1 

11 

0.0 

0.0 

0 

29 

•4^ 

• 

ro 

0.7 

1 

12 

26 . 8 ^ 

4.8 

1 

30 

0.0 

0.0 

0 

13 

0.4 

0.1 

1 

31 

5.1 

0.9 

1 

14 

2.7 

0.6 

1 

32 

49.0''' 

8. 2 

1 

15 

16. 

2.8 

1 

33 

19.5'^ 

3.3 

1 

16 

5.4 

0.9 

1 

34 

1.7 

0.3 

1 

17 

4.0 

0.7 

1 

35 

4.5 

0.8 

1 

18 

6.2 

1.0 

1 

36 

0.0 

0.0 

0 


+ indicates significance at 95^^ level 

a First order model used if [DD^] is positive definite 
otherwise zeroeth order model used. 
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Table 5.9* They may be compared with the c oire spending 
results for a stationary model (Table 5*5) and the univa- 
riate residuals (Table 4*7)* The results of the chisquare 
test on the residuals of the nonatationary multivariate 
model for the tend ally series are given in Table 5.10, 


TABLE 5.9 ESTIMATES OF MULTIVARIATE MONTHLY RESIDUALS 

(ROUSTATIONARY FIRST ORDER MODEL) 


NUMBER OF SEASONS WHEN HYPOTHESIS IS 
REJECTED AT 95^ LEVEL 


TABLE 


RIVER 

MM 

STANDARDISATION 

TiLE 

BY 

MLS 

1 

2 


2 

2 

2 

0 


0 

0 

3 

0 


0 

0 

.0 ^ BSTIMAOIES 

OF MULTIVARIATE TENDAILY 

RESIDUALS 

(NOHSTATIONAET MODEL) 

NUr-lBER OP 

SEASONS WHEN 

HYPOTHESIS IS 

REJECTED 

AT 95 fo LEVEL 



RIVER 


STANDARDISATION BY 



MM 

MLE 

MLS 

1 


6 

6 

5 

2 


1 

1 

1 

3 


2 

2 

3 
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They may be compared with the corresponding results for the 
residuals of the stationary model (Table 5.6) and the uni- 
variate residuals (Table 4.7). The results indicate that 
for both monthly and tendaH-y data? the residuals frcm a 
nonstationary raultiTO.riate model are more nearly normally 
distributed than the residuals of the stationary multivariate 
model or the univariate residuals. 

Independence within the time series s The correlogram 
and spectral analyses indicate that each of the multivariate 
residual series may be considered as pure random- Furthermore, 
the seasonal serial correlation coefficients are estimated 
for each series for each season and are shown in Fig, 5*10 
for the monthly series and Fig. 5.11 for the tendaily series- 
In either case, they are not statistically significant at 
95?^ level. Hence the residual aeri^ from the nonstationary 
multivariate model are not serially and seasonally correlated. 

Independence among the thne series’. The crosscorrelation 
coefficients between any pair of the multivariate residual 
time series are estimated for each season. The results for 
the monthly series (MLS) are given in Fig, 5.12. They may 
be compared with the corresponding 3re suits for the stationaiy 
model (Fig. 5*6). For the monthly series, the crosscorrelation 
is statistically not significant at 95^ confidence level for 
the nonstationary model while it is significant for about 10 
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of the cases of the stationary model, For the tendaily 
series (Fig. 5.13) there is a significant reduction (to 
almost one) in the number of seasons vrith significant 
seasonwise crosscorrelation * The nons tationary model is 
found to be superior to the stationary model for the 
monthly and tendaily series and hence is to be preferred, 

5 • 4 Gone lus ions ; 

On the basis of the multivariate analysis the 
following conclusions are derived! 

i) A stationary multivariate first order AR model 
can be fitted to the annual data. The results show that 
the residuals are not correlated within themselves, among 
themselves and seasonally. Further they have a nearly 
normal distribution, 

ii) A stationary multivariate zei'oeth order AH model 

can be fitted to the monthly and tendaily data. The fittin^ 
of the stationary model involves the estimation of a large 
number of parameters, but they are estimated generally from 
a large sample of data. The estimates are hence reliable. 

By decoupling the variances, the parameters are estimated 
in two phases. Hence the estimates have a reliability 
comparable to those of the respective univariate and multi- 
variate models. The residuals of the stationary model are 
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not serially correlated and do not exhibit significant 
coherence^ However a significant seasonwise crosscorrelation 
between the residuals is indicated in the case of 10 to 15 
of the seasons. Since this violates the assumption of inde- 
pendence of residuals, a stationary model is not considered 
satisfactory. 

iii) A nona tation ary multivariate first order AR model 

can be fitted to the monthly data. For nonstatlonaiy 
modelling, the number of parameters estjmated is q.uite 
large. For example, for the case studied, for the monthly 
data, the sample consists of 900 observations and the total 
number of parameters estimated are 252 including mean, 
standard deviation and autoregressive coefficient for each 
river during each season and three elements of C^q] ^ind 9 
elements of for each season. Since the parameters are 

similar to those of the annual series and the sample sizes 
are also comparable, the accuracy of the results may be 
considered to be comparable to those of the annual series- 
The results show that the residuals are not correlated 
within themselves, among themselves and seasonally. Further 
they have a nearly normal distribution. So the nonstationary 
multivariate model seems preferable to the stationary mode"' 

iv) In the case of ten daily data, it is not possible to 
fit the nonstationary multivariate first order AR model for 
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4 out of 36 seasons. This is because [D^][D’^]'^ for 
these seasons is not positive definite. In such oases, 
it is necessary to either use other suitable models or to 
use special procedures to take .care of the inconsistencies- 
While a zeroeth order multivariate model may be fitted in 
such cases, the [M^^] matrix is not preserved and this 
may lead to inconsistency of the model taken as a whole. 



I'’LOV( POHaCASTIITG MI) liSTIMATIOW 


r 

O. 

6.1 General 

A stochastic model can be usei- to infer the range 
of future values based on past records. Forecasting consists 
in deriving the conditional distribution of the future 
observations from the pa,st observations. It includes the 
determination at' the characteristics oi' that distribution, 
viz., the conditional expectation (mean) referred to herein 
as the ’forecast’ of any future observation and its variabi- 
lity about the forecast. It ca.n be siiov/n that the conditional 
expectation has the property of being the minimum mean square 
error forecast. When ostimates of tho present or past value 
of one variable ivS to be made, on the basis of earlier 
records of itself only or with concurrent and past records 
of other variables as well as in fitting up of voids in 
data series , it is referred to as estimation. 

Consider a process x(t). A n-step ahead (lead time 
n) forecast implies the determination of x(t-tn) given the 
present and past values of the process, -x(t), x(t~l), x(t-2) 
etc. in the case of univariate forecasting and- correspond- 
ing values of related time series in the case of multivariate 
forecasting. Hence the forecast x^(t+n) is given by 
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}i[x(t+n) .x(t), x(t-l), ]. It is assumed that the 

present and past values of the time series are available 
as observations. If n = 1, the foreca.st is referred to as 
the one-step-ahead forecast. The error of a forecast is 
given by the difference between the forecasts and actual 
observations. It is generally assumed that the error is 
normally distributed. Hence confidence liraits for the 
forecasts can also be specified. The confidence limits 
express the uncertainity in forecasts and the wider apart 
they are, the less reliable is the forecast. In addition, 
since the distant future is more uncertain than the imme- 
diate, it will be seen that the confidence interval 
increases with the lead time. 

In the case of a pure random process, the future 
values do not depend on the present or past values and 
hence there is no advantage in knowing them. On the other 
hand, in the case of serially and mutually dependent processes, 
the deterministic component due to internal and external 
dependence can be estimated from the present and past values 
of the time series and the error or estimate is correspond- 
ingly reducod. Hence by using the univariate and multivaricite 
time series models, it is possible to forecast the future 
values with smaller forecasting errors for a given level of 
confidence. In this study, only one-step-ahead forecasting 
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is considered. Fur’ohermore, forecasting is done only for 
the annual and monthly series. 

Forecasting generally involves (i) the identification 
and/or postulation of a model (ii) esti.iation of the model 
parameters (iii) validation of the assumed model and, if 
necessary, re postulation of a different model for valida- 
tion (iv ) adoption of the model for forecasting and (v) compa- 
rison of forecasts with observations. Steps (i) to (iii) 
have already been done in the earlier chapters and the 
results are used for steps (iv) and (v) in this chapter. 

5 . 2 Fore c as ting ?Iod els 

6.2.1 Univariate models 

The following univariate time series models have been 
adopted (Chapter 4) for the normalised standardised series. 

i ) Annual series 

y.(t) = E(t) (6.1) 

ii) Monthly series s 

x(t) = x(t-l) + E(t) (6.2) 

when standardisation is done by MLE; and 

x(t) = 0 ^+ 0 ^ x(t-l) + £{t) (6.3) 

when standardisation is done by MI and MLS, with 
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= E[x(t) ix(t-l)y x(t~2), ,,,] 
is as fo3j,ows • 

Annual series : 

( t ) = 0 (6,4) 

Monthly series s Eor standardisabion by riLE 

Xj(t) = (^1 x(-t-l) (6.5) 

and for standardisation by MM and MLS 

x^(t) = ^0 (6.6) 

The standard error of the forecast is given by s^, 

s 

the standard deviation of a(t) for sea-son j. The x^(t) values 
may then be destandardised to yield y.p(t) values. 

Annual series? 

y^(t) = x^(t) s^ 4- y, and, (6.7) 

Q^(i) = yf(t) (6.8) 

Assuming normality of errors the 95?^ confidence 
limits are y^(t) 1.96 s^ . 

Monthly series? De standard! sat ion of x.p.(t) into y^(t) gives 

yj.(t) = Xf(t) s| + y“ (6.9) 

The corresponding standard error and 95% confidence limits 
are, respectively, 
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SE^ = and (6.10) 

CL = y^Ct) + 1.96 SE^ (6,11) 

The forecasts y^(t) can be transformed back to the original 
domain using Eq, 5,5 and 3.6. 

6,2,2 Decoupled multivariate models 

The model adopted consists of univariate models for 
the normalised and standardised data at all the sites to 
prewhiten the time series and a nonstationary multivariate 
first order AR model relating the standardised residuals 
of the univariate models, viz., 

fx(t)]; = I [IZS^] {x(t-i)^ + (e(t)^ (5.2) 

jE(t)1| = •^e(t)/s^ Jand (5.3) 

[E(t)1f = [cj] [B(t-l)} + [D^] {Ti(t)^ (5.5) 

If MLS is used for standardisation, the above expressions are 
to be modified as indicated in Subsec. 6,2.1, 

Combining the above equations, 

/x(t)^ = I [0^] {x(t-i)} + [s^][oj] fE(t-l)] 

+ [a^][D^] {^{t)^ (6.13) 
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Further, as {Y(t)'^ = [B^]{x(t)^ + 

{Y(t)'>5 = {yJ'^+ [s^] J [0j{x(t-i)^ 

+ [s^j[sJ][C'’] /E(t-l)'’s 

+ [S;J][a^][D^] /h(t)3 (6.14) 

In the above, [0^]? [®f] diagonal matrices 

respectively representing the univariate i~th order AR coeffi- 
cient, standard deviation of the transformed flow and the 
residual variance for the j-th reason. Vlithout loss of 
generality, it is assumed that the variable to be forecast 
or estimated is the last or iC-th variable of the K X 1 
vector '^X( t . In multivariate modelling, all flow data 
preceding the current flow at all the sites are known. Hence 
the first 3 terms on the right hand side of Eq, 6,14 repre- 
sent the deteministic multivariate components for forecast 
and the last terra , the random component for forecast. The 
expected value of one -step-ahead forecast for the K-th site 
at time t is given by, 

Xf(t) = (K,K) Xg(-t-i) + s^^(K,K) {ci]fE(t-l)] 

(6,15) 

where refers to the K-th row of the [C^] matrix, via., 

= ^a^(K,i), cJ(K, 2)...a^(K, k)’. 


and x^(t) is x{t) for the K-th site 
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The standard error of forecast is given hy, 

SE^ = (K,K) I 2 [d^'(K,n)2]^^ (6.16) 

I L n~l 

where d‘^(K, n) is ihe (K, n)'"th element of the [D^] matrix. 

Annual series s A stationary multivariate first order 
AR model has been fitted. As the univariate model is a pure 
random series^ the i) tejrms are zero. The foreoast^its 
standard error and 95^ confidence limits are given respeo*- 
tively by 

x^(t) = a^(ic,K:) {E(t-i)]- (ea?) 

and f K „ n i 

SE^ = Eg (K.K) is (K,rL)\'^ and 

(- 11=1 ■> 

CL = x.(t) + 1.96 SJJ (6.18) 

I - Xj 

Monthly series* For the' stationary multivariate model 
for the monthly series, the [C] matrix is not significantly 
different from zero. Forecasting by the multivariate model, 
then, is the same as that by the univariate model, (See 
Subsec, 6.2.1). For the nonstationary multivariate model, the 
forecast and Its standard error are given by 

Xf(l:) = (K,IC) Xjj(t-l) + s^(K,K) {c!j^^{E(t-l)] 

and 

SE,^, = sJ(E,E) [ s, (d^(K, n))"^]^ (6.19) 

f n=l 


1?3 


6,2, 3 Forecasting 

Using the models developed in, and the results of, 
the previous chapters, one -step-ahead forecasts were made for 
station 3 for the annual and monthly series. Univariate 
f orec as ting of annual flows requires no previous data, 
Multivariate forecasts of annual flows and multivariate 
forecasts of monthly flows using the nonstationaiy model 
require the present value of the variable at all the sites . 
Using the methods described in Subsec. 6.2.2, the forecast, 
its standard error and the 95?^ confidence limits were 
calculat ed . 

AnnuaJ. series! The forecasts were made for 5 years 
using the univariate model and the multivariate model. They 
are shown in Fig. 6.1 along with the 95^ confidence limits 
on the forecasts. The standard error of forecast for the 
multivariate model is 0.86 as against unity for the univa- 
riate model. The multivariate forecasts are centred around 
points other than the mean which corresponds to univariate 
forecasts, and the confidence limits for the multivariate 
model are narrower than those for the univariate model. The 
actual observations lie within the 95^ confidence limits. 

It may be inferred that multivariate forecasts are better 
th an uni var iate f o re c ast s . 
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Monthly series; The stationaiy multivariate model 
fitted to theunivariate residuals is a. zeroeth order AR 
model and so the forecasts of the sta.tionary model are the 
same as that for the univariate model. Using the historical 
record for one year, forecasts for months 2 to 12 were made 
respectively from the observations for months 1 to 11. The 
standard errors of forecasts are shown in. Fig. 6.2 and the 
forecasts with their respective 95 ^ confidence limits are 
shown in Fig. 6.5. 

The univariate forecasts and multivariate forecasts 
differ from the mean. The standard error for the univariate 
forecast is less than the standard deviation of the variable 
and, in turn, the standard error of the multivariate forecast 
is less than that of the univariate forecast. It is seen that 
the forecast for the fifth month lies outside the confidence 
band for any method of forecast and hence seems to be a rare 
event. The forecasts for other methods are within the 95 9^ 
confidence limits. 

6,5 F^stimation 

6.5.1 Models for estimation 

Univariabe ‘estimation and univariate forecasting are 
identical. In multivariate estimation, in addition to the 
past values of itself and other stations, current values of 
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other stations are also used. The estimate herein, made for 
station is given by, 

+ V d(K,n) r) (t)] 
n=l 

( 6 . 20 ) 

for the annual series, anr. 

Xgg^(t) = (11,11) Xg(t-l) + s^(K,IC)[{c|]-^E(t-l)} 

IC-1 . 

+ 2 d^(K, n) ri^(t)] (6.21) 

n=l 


for the monthly series, where is the n-th element of 

the r)(t) vector. The corresponding standard errors of estimate 
are given respectively by 

SE^ = s^(K,K) d(K,K) (6.22) 

es t 

for the annual series y and 

SB^ = s^(K,IC) d^(K,K) (6.23) 

est ^ 

for the monthly series, 

6,3.2 Results of estimation 

The normalised standardised flow in station 3 is 
estimated using the eq^uations given in Sub sec . 6.3.1* Past 
and current data at stations 1 and 2 as vjell as the past data 
at station 3 are used in the estimation* 
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Annual series : The univariate estimate wMch is 
identical to the forecast is the mean flow whereas the multi- 
variate estimate differs from the univariate estimate and the 
multivariate forecast. Furthermore, the standard error of 
multivariate estimation is 0,4 2 which is smaller than that 
of the multivariate forecast viz., 0.36 vrhich in turn is 
smaller than that of the univariate forecast of unity. The 
forecasts and estimates as well as their 95 confidence 
levels are shown in Fig, 6,1 for a period of 5 years. The 
actual observations lie within the confidence intervals. 

Hence for annual series, multivariate estimation is to be 
preferred to forecasting. 

Monthly series s . The univariate estimate is the same 
as the univariate forecast. The standard errors of univariate 
stationary multivariate and nonstationary multivariate esti- 
mates are shown in Fig, 6,2 along with the standard errors 
of forecasts. They indicate a progressive decrease in the 
standard error in the following orders univariate forecast 
or estimation, nonstation ary multivs-riate forecast, stationary 
multivariate estimation and nonstation ary multivariate estima- 
tion, The data for the same period as in the case of fore- 
casting were used for estimation. The multivariate estimates 
and their 95^ confidence limits for stationary and nonstationar^r 
models are shown in Fig, 6,5. It may be noted that the 
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estimates are different and they, in turn, differ from the 
forecasts. It isseen that for month 5? the actual flov; lies 
outside all confidence bands and is a rare event. 

For month 2, the actual flow is outside the confidence 
limits for stationary and nonstationaiy multivariate estima- 
tion and for month 7, it is outvside the confidence limits 
for nonstationary multivariate estimation only. Whether 
these are due to sampling or modelling error needs further 
investigation, Hovrever because of smaller standard error 
of estimate associated with the nonstationary multivariate 
models, they may be preferred to other models* 

6,4 Conclusions 

i) The decoupled model can be used for one- step-ahead 
streamflow forecasting as well as for filling- in of gaps in 
data using data of streamflows at adjacent stations. 

ii) The annual data indicate that a multivariate model is 
useful for forecasting in comparison to a. univariate model 
and that multivariate estimation is preferable to multiva- 
riate forecasting, 

iii) Forecasts and estimates have been made for 11 months 
only. The forecasts indicate the general superiority of 
the nonstationary model over others. 
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iv) The multivariate estimates have a narrower confidence 
band than others and for the limited sample used for testing, 
sometimes result in the actual occurrence being outside the 
confidence band, l^liether this is due to sampling errors 
needs further inves t igs^tion . 


A 



7. summary', OONOLUSIOiTS AND SUGUESTIOHS FOR 

FURTHER STUDY 


7.1 Summary 

The design and operation of a uater resources system 
should he based on a clear understanding of the processes 
affecting them v;hich in turn depend on proper analysis and 
interpretation of data. Particularly important are the 
streamflovr data which exhibit complex stochastic characteris- 
tics. They include j 1. a nonstationary behaviour in terms 
of the seasonal variation of the mean, standard deviation, 
serial dependence, etc.j 2. trends due to natural or human 
influences? 5. periodic or cyclic components; 4. autoregres- 
sion or persistence within the time series; 5» moving average 
or external correlation between random components at the 
site; and 6. correlation in space or among time series. 
Generally factors 1 to 5 are studied in the univariate 
modelling of a time series, say, the streamflow at a site 
and factor 6 is studied in the multivariate modelling of 
several time series. Knowing the dependence or independence 
of streamflows at different sites, it is possible to design 
and operate multi reservoir systems by taking advantage of 
the interdependence. 
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Mathematical modelling- of streamflow* at a site is 
itself fairly complicated* Modelling of multisite stream- 
flows is further complicated by the fact that the multi- 
variate streamflovj variability may be attributed to i ) the 
wi thin-the-statioii variations in terms of factoid 1 to 5^ 
and ii) the among-the-stations variations in terms of the 
crosscorrelations between the streamflows at different 
sites. Univariate models in hydrology deal wiih the foimer 
and multivariate models generally consider both simultaneous- 
ly. It seems possible to decouple the variability due to 
these two classes so that the within- the-station variability 
can be first estimated by univariate modelling and used for 
identifying the serially independent rendom components at 
each sitej and then these random components can be related 
through a multivariate model to account for the inter^station 
correlations and the among the stations variability. Such 
an approach seems to be advantageous because of the following; 

1) The expertise developed in the past in univariate 
time series modelling can be beneficially used. 

2) As the serially independent aiid normally distributed 
variates are used in the multivariate models a better 
performance of the model can be expected. 

3) Since the parameters are estimated in stages, the 
number of parameters estimated in. each stage is comparable 
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to that of univariate and multivariate model respectively 
and the problem of simultaneously estimating all the para- 
meters is avoided a 

4 ) Where data lengths vary from station to station? 
all the' available data can be used in the estimation of 
parameters and decoupling may eliminate errors and bias 
that may be present in the simultaneous estimation of 
parameters = 

5) Decoupling of spatial and temporal ■variations has 
been suggested recently by Yevjevichy Mejia and Iturbe ^ 
and Yevjevich and Karplus . Rather than using simple 
spectral and; .correlation procedures to represent spatial 
variability? the use of multivariate models is suggested 
in this stiidy, This is similar to the approaches of Rro^L 
and Clarke and Yevjevich, 

6) Decoupled multivariate models seem to be very power- 
ful tools in representing complex raiO-tiple interrelated 
time series , 

Without loss of' generality, the study was restricted 
to three sites in a river basin in North India for which 
were available and to a 25 year concurrent record of stream- 
flow, Data series considered in this study include i) the 
tendaily historical data ‘ '.series j ii) the monthly data series 
calculated from the tendaily series as the average of the 
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three tendaily values in each month; and iii) the annual 
series, being the average of the monthly values in each 
year. Annual data \rere normally distributed; but monthly 
and ten daily series were normalised by logarithmic trans- 
formation. The data series were standardised to a mean zero, 
unit variance series by means of the sample means and 
standard deviations for each of the periods. Using AKIMA. 
models and a nonlinear least squares regression approach, 
univariate models were fitted to the standardised normalised 
time series at each site. The results indicated autoregres- 
sive models of orders zero, one and three respectively for 
the annual, monthly and tendaily series. The residuals were 
calculated and tested for serial independence. Time domain 
and frequency dmain analyses indicated serial independence 
when the full series was considered; but, for monthly and 
ten daily series the correlation coefficients between the 
residuals of adjacent periods were found to be significant 
in several cases. This indicated a seasonal variation of 
persistence and so nons tationaiy autoregressive models of 
the same order as before but with seasonally varying para- 
meters were fitted to the monthly and tendaily series. The 
residuals from the nonstationaay models were found to be 
serially uncorrelated seasonally and as a whole. The uni- 
variate residuals were also generally normally distributed. 
The normally distributed serially independent residual 
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series are generally referred to as ' pre whitened ' series 
and the process of identifying and separating such residuals 
is referred to as * prewhitening' , Hence the first stage in 
decoupled multivariate modelling involves 'prewhitening' 
the univariate data series. 

The second stage consists in fitting a multivariate 
time series model to the univariate residuals to take into 
account the araong-the -series variation. Using currently 
available procedures (Matalas, Young and Pisano), a sta- 
tionary first order autoregressive multivariate model was 
fitted to the univariate residual series. The multivariate 
residuals were evaluated and tested for independence 
serially and with one another. When considered as a whole, 
the series vrere found to be independent serially and of one 
another? but, monthly and tendaily series showed significant 
crosscorrelation on a seasonal basis in several periods. 

This suggested a nonstationary multivariate model with sea- 
sonally varying parameters. For the tendaily series, for 
four seasons it was not possible to fit nonstationary first 
order autoregressive multivariate models using the procedures 
adopted for stationary modelling. This is because of incon- 
sistencies in the sample correlation matrices and the prob- 
lem was similar to cases mentioned by Mat alas and Wallis. 
Zeroeth order models were fitted to the tendaily series in 
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such cases. Development of procedures for fitting first 
order models and for estimation of parameters in such cases is 
beyond the scope of this study. Monthly data did not exhibit 
any such problems. The multivariate residuals from the 
nonstationary model were estimated and when tested statis- 
tically » they were found to be independent serially and of 
one another, seasonally and as a v/hole. 

It is possible to recouple the multivariate and 
univariate models and also use inverse transformations to 
denorraalise and destandardise them in the proper order so 
that the complex relationships between the actual flows 
can bederived in terras of the parameters estimated earlier. 
These relationships can be used to predict missing data, 
if any; forecast future values of the streamflows at any 
site; and generate multisite streamflow data for simulation 
of complex water resources systems. 

One step ahead forecasting and predictions were 
considered in this study using the multivariate model deve- 
loped earlier. It was seen that generally multivariate 
estimation and forecasting leads to a better definition of 
the expected value a.nd confidence levels of the variable 
concerned than in the case of univariate models. 
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7*2 Oonclusions 

On the basis of this study, the following conclusions 
can be mades 

i) Decoupling facilitates the use of simpler component 
models in the modelling of complex stochastic processes , and 
leads ±to a better understanding of the temporal and spatial 
dependence between streamflows at different sites, 

ii) For the standardised annual series, a zeroeth order 
univariate autoregressive model and a first order multivariate 
autoregressive model are indicated. For the noimalised, 
standardised monthly series, a nons tationajcy first order 
autoregressive univari 0 .te and multivariate model are indicated, 
Tendaily data after normalisation and standardisation, can 

be represented generally by a third order autoregressive 
nonstationaiy univariate model and a first order auto- 
regressive nonstationary multivariate model. For some 
seasons, the sample correlation coefficients for the non- 
stationary model were inconsistent with the assumption of a 
multivariate first order autoregressive model. In such 
cases, a zeroeth order multivariate model was fitted to the 
tendaily series. 

iii ) Multivariate models can be used for the estimation 
of missing values, and for one step-ahead forecasting. 

They generally lead to a better definition of the conditional 
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expectation and a amaller standard deviation of the forecast 
or estimate , 

Da ca can be generated on the basis of the decoupled 
multivariate model and used in simulation analysis of multi- 
reservoir systems. Decoupling' is a very powerful tool in the 
mathematical modelling of interrelated time series and the 
procedure developed herein can be used for modelling other 
multivariate processes. 

7.3 Suggestions for Further Study 

Based on the results of the study, the following 
suggestions are made for future work area; 

Decoupled multivariate models have been used for 
representing multisite streamflows in three rivers of North 
India. The decoupled model consists of nonstationazy 
univariate AB models and zeroeth or first order multivariate 
AR models relating the univariate residuals. It seems 
possible to develop other models in a decoupled framework. 
Investigations should include resolution of inconsistencies 
between data and assumed models, Furtho^mnore, the procedure 
may be applied to other regions of India and abroad in order 

that regional space-time correlations can be identified and 
defined. 
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ii) Decoupled multivariate models have already been used 
to represent input-output relationships in other areas of 
engineering. The component processes of hydrologic cycle 
have been represented through input-output models. Hence 

it seems possible to represent input ••output models in hydrology- 
through decoupled multivariate models, 

iii) Generally, in the representation of complex processes, 
simple models with parsimony in parameters are preferred. A 
decoupled model with simple components may hence be preferred 
to more complicated nondecoupled models. The results of the 
study indicate that the inter station correlation is nonstation- 
ary in at least some seasons and hence a nonstationary multi- 
variate model is required on conceptual grounds. It seems 
necessary to use simulation models and generated data based 

on stationary and nonstationary models to justify on prag- 
matic grounds the need for more complicated nonstationary 
multivariate modelling. 






190 


REFERS J^GSS 


1. Anderson, T*V/, , An Introduction to Multivariate 
Statistical A nal ysis . John V/iley and Sons, New iork, 
1958. 

2. Beard, I.R., *Use of Interrelated Records to Simulate 
StreamfloTrfs' , Journal of the Hydraulics Division . 

ASGE, Vol. 91t No. m:5, 1965, pp. 13-22. 

3. Box, G-.E.P,, and Jenkins, G-.M, , Time Series Analysis , 
Fo recastinis; and Control . Holden-Lay Publishing Co* , 
Sa^ Franscisco , 1970. 

4. Carlson, R.P., McCormick, A.J.A. , and Watts, D. G. , 

* ADulication of Linear Random Models to Four Annual 
Streami'low Series ' , V /ate r Resources Research o V ol. 9 , 
No. 4, 1970, pp. IO70I1073. 

5. Chow, V.T., Handbook of Applied Hydrology . McGraw Hm 
Book Go, , New Toii^: , 19^64 * 

6* Clarke, R.T,, Mathematical Models in Hydrology , Pood 

and AgriculturS Organisation , Rome , 1973. 

7. Cox, D.R. , and Miller, H.D*, The Theory of Stochastic 
Processes . John Wiley and Sons, New York, 1965. 

8. Crosby, L.S., and Maddock III, T,, 'Estimating Coeffi- 
cients of a Plovr Generator for Monotone Samples of 
Data,' W ater Resources Res e arch .. Vol. 6, No. 4, 1970, 
pp. 1079^-108^“: 

9. Peller, Vf. , 'The Asymptotic Distribution of the Ran^ 
of Sums of Independent Randan Variables' , A nnals 0 f 
Mathematical Statistics , Vol. 22, 1951, pp. 427-430. 

10. Piering, M.B. , *A Multimriate Technique for Synthetic 
Hydrdogy' , Journal of the Hydr^lics Division . ASGE, 
Vol. 90 , No.‘hY 5, 1^4, “pp. 43-60. 

P i er i ng , M . B . , Stre amf 1 ow S.vnth es i 3. Harvard Uni ve rs i ty 
Press, Cambridge, Massachusetts, 1967. 


11 



191 


12* Fiering, M.B,, ’Schemes for Handling Inconsistent 
Matrices Vfater Beaources R esearch , Vol. 4? ^o, 2, 

1968, pp. 291-297, 

13. Frost, J, , and Clarke, R.T,, ’Use of Gros scorrelation 
Between Hydrologic Time Series to Improve Estimates 

of lag one Autoregressive Parameters’ , W ater Resourc es 
R esearch , Vol. 9, No, 4, 1973, pp. 906-'917. 

14. Fuller, G.A. , 'Inconsistent I-Iatricesin Estimating 
Ungaged Streajnflow lata’ , y/ate r Resources Research . 

Vol. 10, No. 6, 1974, pp. 1249-1250.'' 

15. Garcia, l.B» , Bawdy, l.R. , and Hejia, J.M. , 'Long 
Memory Monthly Stream flow Simulation by A Broken 
line Model', V/ater Resources Research , Vol. 8, No. 4? 

1972, pp. 1100-1105. 

16. Granger, C.V/.J. , and Hatanka, M. , Spectral Analysis of 
Econo m ic Time Series . Princeton University Press, 

Princet on , New Jers e y , 1964. 

17. Hamlin, M. , Unpublished Report on the Workshop on 
Stochastic Hydrology, Symposium on the Application 

of Mathematical Models in Hydrology, Bratislava, 1975- 

18. Hurst, H.E,, ’long Term Storage Capacity of Reservoirs', 
Trans actions , American Soci et v of Civil Engineei B. Vo 1 . 116 , 
1951, pp. 770-“799. 

19. langbein, VJ.B* , 'Queuing Theory and Reservoir Storage', 
Proceedings of the Hydraulics Division . ASCE, Vol, 84, 

No. HY5, 1958, pp. 1811-1831. 

20. linsley, R.K. , Kohler, M.A, , and Paulhus , J.l.N, , 

Hydrology for Engineers o LIcGraw Hill Book Go,, New 
York, 1958. 

21. Maass, A., Hufschmidt, M.U, , Dorfman, R. , Thomas, Jr., 

H.A. , Marglin, S. A. , and Fair, G.M. , The Design of 
V Jater Resource Systems . Harvard University Press, 

Cambridge, Massachusetts, 1962. 

22. Mandelbrot, B.B,, 'A Past Fractional Gaussian Noise 
Generator', VJater Resources Research . Vol. 7, No. 3, 

1971, pp. 543-553. 

23. Mandelbrot, B.B,, and Wallis, J.R. , 'Noah, Joseph and 
Operational Hydrology* , Water Resources Research , 

Vol. 4, No. 5, 1968, pp. 909-918. 



19 2 


24. Mandelbrot, B,3. , and V/allis , J.R, , ’Computer 
Experiments with Fractional Gaussian Noises ' , 

V/ater Hesources Research ^ Vol, 5? No. 1, 1969? 
pp. 228-^267. 

25. Matalas , N.G,, ’Mathematical Assessment of Synthetic 
Hydrology’, W ater Resources Research . Yol. 3, No. 4,. 

1967? pp. 931-945. 

26. Mat alas, W.O., and Wallis, J*H, , ’Statistical Proper- 
ties of Multivariate Fractional Noise Processes’ , 

V/ater Resources . Research . Vol. 7, No. 6, 1971, pp.l460- 

1468 , 

27. Ma talas, N.C,, and V/allis, J.N, , ’Correlation Constraints 
for Generating Processes', Proceedings Symposium on 
Mathematical Models in Hydrology . Warsaw, 1972, 

28. McKerchar, A. I., and Delleur, J.W. , ’ Stochasti c 
Analysis of Monthly Flow Data - Application to^'lower 
Ohio River Tributaries , Technical Report No.*" 26, 

V/ater Resources Research Center, Purdue University, 
Lafayette, Indiana, 1972. 

29. McMahon, T.A. , Oodner, G.P. and Philips, 'Single 

and Multisite Operational Hydrology' » N o rdi c Hydrology, 
Vol. 3? No. 4, 1972, pp. 214-238. 

30. Mejia, J.M. , Dawdy, D.R, , and Nordin, C.F. , ' Streamflow 
Simulation, 3. Broken Line Process and Operational 
Hydrology', V/ater Resource s Res earch . Vol. 10, No. 2, 
1974(a), pp. 242-245. 

31. Mejia, J.M., and Iturbe , I.R., ’On the Synthesis of 
Random Field Sampling from the Spectrum. An Application 
to the Generation of Hydrologic Spatial Processes’. 

Wate r Ro s ou rc es Re s e a r^ch , Vol. 10, No. 4? 1974? (b), 
pp. 705-712. 

32. Moran, P.A.P., The ory of Stor age . Methuen, London, 

1959. 

33* Ne 1 s on , 0 , R . , App lied Tim e S eries Analysis for 

Man age ri^ Fo re c as t i n g . Holden- Day ihiblishing Go. , * 

San Francisco 1973 . 

34. Nordenson, T.J. . Application of Conceptual Catchment 
Models to River Forecasting , Te clmi Not e ' ' 9^," ' 

V/o rid l5et e o ro 1 o gi cal ^ r gani s a t i on , Geneva, 1969. 



194 


45- Wallis, , and Matalas, N.O., 'Small Sample 

Properties of H and K, ’ V/ ate r Resources Research » 

Vol. 6, Wo, 6, 1970 , pp, 1583‘-i594. 

46 . Wallis, J.R, , and Matalas N,C*, 'Correlogram Analysis 
Revisited,' Water Resources Research , Vol, 7i. Wo. 6, 
1971 , pp. 1448-1459. 

47 . Ye vd j e vich , V* i'i . , P luc t uat i 0 ns of Wet and Dry Years , 
Part 1 1, Hydrology Paper Wo 2, Colorado State Univer- 
sity, Port Collins, Colorado, 1964. 

48. Yevjevich, V,, Systems Anpi pac h to Hydrology , Water 
Resources Publications, Port Collins, Colorado, 1971. 

49 . Yevjevich, V., P robabi l ity and Statistics in Hydrology , 
V/ater Resources Publications, Port Collins, Colorado, 
1972 (a). 

50 . Yevjcvich, V. , Stochastic Processes in Hydrology . 

Water Resources Publications, Port Collins, Colorado, 
1972(b). 

51 . Yevjevich, V. y Generation Q.f .Hydrologic. Samples . Case 
3 ty^X^Pi the Great Lak es , Hy d r dogy Paper Wo, 7 2 , 

Col o rad o St ate Uni ve rs i ty , 1975. 

52. Yevjevich, V,, and Karplus, A.l'T. , Area~'J?ime Structure 
of the Monthly Precipitation Process , Hy d r ol 0 gy Pap e r 
Wo. 64 , Colorado State University, Port Collins, 
Colorado, 1972 » 

53. Young, G.K., and Pisano, W.C., ^Operational Hydrology 
Using Residuals', Journal of the Hydraulics Division , 
ASCE, Vol. 94, No. HY4, igSaypp. 909 - 923 . 



193 


35* Pentland, R.L., and Cuthbert, D.R. , *Operational 
Hydrology for Ungaged Streams by the G-rid Square 
Technique', Hater Resou rces Res earch , Vol. 7* Ho. 2, 
1971, pp. 283-291. 

36. Qaimpo, R.U., 'Sto chastic Mo del of Daily River Plow 
Sequences ' , Hydrology Paper ITo. 18, Colorado State 
University, Fort Collins, Colorado, 1967* 

37. Ramaseshan, S. , 'Stochastic Modelling of Storm 
Precipitation', |Ioi:<j.i.^Hydjro;LQgy:^ Vol. 2, 1971, 
pp. 109-129* 

38. Ramasoohan,S. , and Krishnasami, H. , 'decoupled 
Stochastic Models for Multn.rjite Strearnflows ' , 

Pro c e e dings , Second Wor ld ^ Congr ess , IV/RA , H ew Re Ihi , 
l975T'Vol. “5, PP- 45-50. 

39. Rao, A, R. , and Relleur, J.W,, The Instantaneous . Unit 
Hydrograoh . Tts Calculation by the Transform Method 
and Noise Con trol by^RigitaaT filtering . Technical 
Report^'Ho'] 20, V/ater Resources Research Center, 

Purdue University, lafayette, Indiana, 1971. 

40. Roesner, L.A., and Yevdjevich, V.M. , ' Mathematical 
Models for the Time Serie s o f Monthly Rainfall and 
Monthly' Runoff, Hydrology Paper No . 16,' Colorado 
State'^'University, Port Collins, Colorado, 1966. 

41. Sariahmed, A., and SCisiel, G.G,, 'Synthesis of 
Sequences of Summer Thunderstorms volumes for the 
Atterbury Watershed in the Tucson Area'. P roceedings , 
Symposium on Usg QP Analog and Ijgital Compute:^ in 
Hydrology . Tucson, Arizona, 1968, Vol. 2, ppl ^39—448 . 

42. Stidd, C.IC. , The n~th Root Noimal distribution of 

Pr e c i pi t at ion , Wa ter Resou rces .R esearch , Vol, 6, Ho. 4, 
1970, pp. 1095-'1103. 

43- Tnyrenin P. . Ano lioa bil ity o f Canon ical Correlati^ 
in H ydro iogy , Hydrology Paper Ho. 58, Colorado State 
Univers ity , Port Collins, Golorrido, 1972, 

44. Valencia, R., and Schaake, Jr., J.O., ' Disaggregation 
Processes in Stochastic Hydrology' , W ater Res^ou^.ea^ 
Research . Vol- 9, Ho, 3, 1973, pp. 580-585. 



/ 

// 


4 


Date Sl ip 



This book Is to bo roturnocj on the 
date last stamped. 


CD 6.72.9 


■E- n'-7 E - D - K R l- - DE 


